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TOPICS IN THE FUNCTIONAL CALCULUSt 


BY L. M. GRAVES 
Part I. THE THEORY OF FUNCTIONALS 


In this lecture it is proposed to outline an abstract theory of 
functionals, with a development paralleling that of the theory of 
functions of real variables, and including also a chapter on 
analytic functionals. In Part II, some applications of the gene- 
ral theory to various sorts of equations are indicated. 

From the abstract point of view, the functional calculus is a 
form of general analysis, and as such it was effectually initiated 
by Fréchet’s thesis in 1906. Since then a large number of re- 
searches have been concerned with the topological properties 
of abstract sets, and with the properties of continuous or semi- 
continuous functionals. 

The postulational basis for an abstract topological theory 
may take various forms. A general basis consists of a general, 
that is, unrestricted class ¥ of elements x, and an unrestricted 
function K on © to S, where G@ is the class of all subsets E of ¥. 
The function K is then a set-valued function of sets. The system 
(X, K) constitutes a topological space. It has been shown by 
Chittenden [7, pp. 294-298]t that a related set-function H 
may always be defined such that the space (¥, H) has the three 
properties: 

I. H(D+£) =H(D)+H(E£). 
II. For every set E, H(E) contains H(H(E)). 

III. If £ is finite, W(£) is null. 

Such a space (X, H) is called an accessible space by Fréchet. 

If the points of H(£) are called the points of accumulation of 
the set E, then closed sets may be defined as usual. A point of 
a set E is interior to E in case it is not a point of accumulation 
of the complement of Z. Open sets are those consisting only of 
interior points. The neighborhoods of a point x may be defined 
as those sets having «x as an interior point. A set E is called com- 


7 An address delivered by invitation of the program committee at the 
Chicago meeting of this Society, April 19, 1935. 
t References in brackets are to the bibliography at the end of the paper. 
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pact in case every infinite subset A of E has at least one point 
of accumulation, that is, H(A) is not null. 

Another basis for a topological theory consists of the system 
(X, So), where So is a subclass of the class © of all subsets E 
of %. If the sets Ep in the class So are thought of as the open 
subsets of X, we are led to definitions of point of accumulation 
and of derived set E’=K(E) which give us a system (X%, K). 
Sierpifiski [29, pp. 1, 28] has given five postulates for the sys- 
tem (¥, So) which are equivalent to the properties I, II, and III 
for the corresponding system (X, K). Hausdorff [17, p. 213] 
considered a system (¥, Go) in which the sets Eo in the class Gp 
are related as neighborhoods to the points of ¥. Hausdorff’s set 
of four postulates is more restrictive than the set given above. 

Still another basis for a topological theory is the system (class 
(2) of Fréchet) (¥, =., L), where 2, is a class of sequences of 
points of X, called convergent sequences, and L is a function on 
>. to ¥, assigning to each convergent sequence its /imit [Fréchet, 
10, p. 164]. Fréchet assumed: 

(a) If x,=x for every n, then L x, =x. 

(b) If L x,=x, then every subsequence converges and has 
the same limit. 

In such a system derived sets E’ = K(£) are readily defined, 
yielding a topological space (X, K). But it is possible for a com- 
pact sequence of distinct points (x,) to have only one point 
of accumulation x without being convergent. However, this 
inconvenience may always be removed by extending the range 
2. of definition of the function Z to a maximum without chang- 
ing the topological character of the space, that is, without 
changing the function K [Urysohn, 30]. If in addition derived 
sets are always closed, the space (¥, K) obtained from (%, 2., L) 
is an accessible space. If this additional condition does not hold, 
a related function H(£) may be defined by the process of Chit- 
tenden so that the resulting space (¥, H) is accessible. But a 
point of accumulation of a set E in the space (X, H) need not be 
the limit of any convergent sequence of points from E. For 
example, let ¥ be the space of all real-valued functions x(é) 
defined on 0</<1, and define L x,=x to mean ordinary con- 
vergence for each value of ¢. Then the related accessible space 
(X, H) defined by the process of Chittenden (which in this case 
is also a Hausdorff space) is such that the derived set H(E) of 
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a given set E contains all functions obtainable from those of E 
by repetitions of the limiting process. Thus if £ is the class of 
continuous functions, H(£) consists of all the Baire functions. 

The notion of limit of a sequence may be defined in any ac- 
cessible space (¥, H) thus: L x,=x in case every neighborhood 
of x contains all the points of the sequence from a certain place 
on. In the example just mentioned, this definition coincides 
with the original definition of limit. However, in a general ac- 
cessible space, a sequence may have more than one limit. In a 
Hausdorff space on the other hand, a sequence can have at most 
one limit. But it is still possible for a set E to have a point of 
accumulation which is limit of no sequence of points from E. 

In discussing functions defined on abstract spaces it seems 
convenient to refer to a numerically-valued function defined on 
an abstract space as an operation, and to use the word trans- 
formation to refer to general functional relations between two or 
more such spaces. 

A function or transformation f(x) which transforms an ac- 
cessible space ¥ into another accessible space ¥) is defined to be 
continuous at a point x of ¥ in case for every open set B con- 
taining f(x) there exists an open set A containing x such that 
the transform f(A) is contained in B. It follows that a con- 
tinuous function of a continuous function is continuous. Also a 
continuous transform of a connected set is connected. If the 
space X is compact and an operation f is continuous on %, then 
f is bounded on %. If every operation f continuous on % is 
bounded on %, then for each such f the set of its functional values 
is closed. As a corollary we may say that a real-valued operation 
f continuous on a closed compact set E has a maximum and a 
minimum on E. A Borel theorem may be stated for an accessible 
space as follows. Let the set E be compact and closed, and let § 
be a denumerable family of open sets O covering E, in the sense 
that each point of E is interior to a set O of the family §. Then 
there exists a finite subfamily of § which also covers E [see 
Hildebrandt, 20, pp. 470, 471; Chittenden, 7, p. 300]. 

A metric space (%, p) consists of a set ¥ of elements x and a 
real-valued operation p defined on ¥X, with the properties: 

(1) p(x1, x2) =0 if and only if x;=%e. 
(2) p(x1, X3) Sp(x1, x2) +p(xs, x2). 
It follows from these two properties that the operation p is 
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symmetric and non-negative. If we agree that L x,=x when 
lim p(x,, x)=0, we see that a metric space is a class (2) of 
Fréchet. It is also a Hausdorff space. It is plain that the dis- 
tance function may be altered without changing the topological 
character of the space. For example, we may obtain a new dis- 
tance function p* by setting p* =p/(1+ ). A fundamental prob- 
lem is that of determining the conditions under which a topo- 
logical space (X, K) is metrizable. That is, when does there exist 
a metric p defining the same points of accumulation for a set as 
the relation K? For certain types of topological spaces the prob- 
lem has been solved by Alexandroff and Urysohn, and by Chit- 
tenden [6]. 

In case the Cauchy condition, limm,n p(xm, Xn) =0, is sufficient 
to ensure the convergence of a sequence (x,) to a point x of a 
metric space (X, p), the space is called complete. The property of 
completeness may be lost by a change of metric, so that it is 
not a topological property. However, we may consider the 
topological property of “completeness for some metric p.” 
Fréchet prefers to apply the word “complete” to any topologi- 
cal space having this latter property. Likewise, from the topo- 
logical point of view, a space which is metrizable is called 
metric. In the sequel we shall be interested in many non-topo- 
logical properties, so that we shall use the term “metric space” 
to refer to a space (X, p) with a definite associated metric p, 
and the term “complete” will be used only with reference to 
that metric, and not in the Fréchet sense.{ 

In a metrizable space the Borel theorem holds without the 
restriction of denumerability of the original family § covering 
the set. Moreover, a continuous transformation f of a closed 
compact set in a metric space into a metric space is always uni- 
formly continuous. 

We have thus far surveyed the elementary ideas and theorems 
of point-set theory and the fundamental properties of continu- 
ous functions as they appear in the foundation of the abstract 
functional calculus or general analysis. Much of the theory of 
abstract sets has been passed by in order to adhere to the plan 
of outlining the analogies of the abstract functional calculus 


t Von Neumann has recently given an interesting discussion of the notion 
of completeness for linear spaces which may not be metrizable. See Transac- 
tions of this Society, vol. 37 (1935), pp. 1-20. 
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with the elementary theories of functions of real and complex 
variables. We shall turn next to the consideration of vector 
spaces, and then develop the differential calculus of abstract 
functions. 

A system (¥= [x], A=[a], +, -, | lI) which has the follow- 
ing properties will be called a normed linear space. 

1. & is the real number system or the complex number sys- 
tem. 

2. + is an associative transformation on XX to &. 

3. - is a transformation on AX to X, associative with multi- 
plication of numbers in %. 

4. - is distributive with respect to + and with respect to 
addition of numbers in %. 

5. 1-x=x for every x. 


6. || || is an operation on X to W. 
7. |[x1-+24l| 
8. |la-x|| = 


9. There is a unique point @ in ¥ such that || 0|| =0. 

A normed linear space is obviously a metric space if we set 
p(x1, x2) =||x1 — ell. In case such a space is also complete, it 
has been called by Fréchet “un espace (D) vectoriel complet” or 
a Banach space. Fréchet prefers to distinguish between the 
“points” of space and the “vectors” associated with ordered 
pairs of points [8; 10, pp. 123-146, 201-204]. Thus an “affine 
topological space” consists of a system (¥, K) with an associated 
system of vectors (Q =[£], +, -, | ll) having certain proper- 
ties. For most purposes I can see no gain in this distinction, 
since by selecting arbitrarily an “origin” 9 in the space ¥ a 
correspondence may be set up between the points x and vectors 
£. Omitting the distinction simplifies the notation and shortens 
the list of postulates. 

In this connection it should be noted that in certain inter- 
esting linear spaces (X, K) which are also metrizable, no metric 
can be defined in terms of a norm satisfying Postulate 8 along 
with all the others [Fréchet, 8, p. 50]. Two of these are the 
space Jt of all measurable functions with limit defined in terms 
of convergence in measure or approximate convergence, and 
the space (£,,) of Fréchet consisting of all infinite sequences of 
numbers, that is, points in infinitely many dimensions, with 
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limit defined in terms of convergence of each coordinate. For 
this reason Fréchet and Banach have considered vector spaces 
of a more general type than the normed linear spaces, and have 
derived a few properties of linear and polynomial transforma- 
tions of such spaces [9; 2, pp. 20-52]. However, this inter- 
mediate domain seems not to have aroused much interest. 

A transformation G(x) of a normed linear space ¥ into a 
normed linear space 9) is called linear in case 


G(a 1x1 + = a,G(x%1) + a2G(xe) 


for every pair of points x1, x2 and every pair of numbers a, de. 
In case such a transformation is continuous, it is also bounded 
or modular, that is, there is a number M such that ||G(x)|| 
< M||x|| for every x. The greatest lower bound of effective M’s 
is called the bound or modulus or norm of G and may be denoted 
by M(G). 

An additional concept which has been found useful in con- 
nection with linear spaces is that of weak convergence, as dis- 
tinguished from convergence in terms of the norm. Schauder 
has proposed the following list of postulates to characterize the 
concept [27, p. 664]. Let ¥ be a normed linear space, and let 
wL be a single-valued function defined on a class 2, of se- 
quences of elements of ¥, and with functional values in ¥. The 
sequences of =, are called weakly convergent. The properties as- 
sumed are: 

(a) If lim ||x,—x|| =0, then wL x, =x. 

(b) If wL x,=x, wl x, =x’, lim a,=a, lim a, =a’, then 
WL +a, x, )=ax+a’x’. 

(c) If a sequence (x,) is weakly convergent, the sequence of 
norms (|lxn ) is bounded. 

(d) If wZ x, =x and ||x,|| < M, then ||x|| <M. 

(e) Every subsequence of a weakly convergent sequence has 
the same weak limit. 

Such a system (X, 2.,, wL) is plainly a class (2) of Fréchet. 

In a given space, various definitions of weak convergence 
having the specified properties are usually possible. For ex- 
ample, weak convergence may always be identified with con- 
vergence in the norm. In this case the concept would be sterile. 
An important property which may sometimes be obtained by a 
proper choice of the definition is: (C..) every bounded set is 
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weakly compact, that is, every sequence which is bounded in 
the norm has a weakly convergent subsequence. 
In case we define weak convergence as follows: wl x,=<x in 
case lim f(x,) =f(x) for every continuous linear operation f de- 
fined on X, it may be shown that the weak limit x of a sequence 
(x,) belongs to the extension of the set [x,] to be linear and 
closed [Banach, 2, p. 134]. Also every continuous linear trans- 
formation G is weakly continuous, that is, wl G(x,) =G(x) 
whenever wl x, =x [2, p. 143]. 
A functional transformation G of a set Xo into part or all of a 
space ¥) is commonly called completely continuous in case it is 
continuous and transforms every bounded subset E of Xo into a 
compact set G(Z). Let us say that G is totally continuous in 
case it transforms every sequence which is weakly convergent 
according to the definition just given into a sequence convergent 
according to the norm. It may be shown by examples that 
neither of these properties implies the other. However, if every 
bounded subset E of Xo is weakly compact, then a totally con- 
tinuous transformation G of Xo is also completely continuous. 
On the other hand a completely continuous linear transforma- 
tion G is also totally continuous [Banach, 2, p. 143]. 
It may not be out of place to remark here that the spaces of 
“modular” functions considered by E. H. Moore in his second 
general analysis theory are all normed linear spaces. Professor 
Moore defined a sequence of modular functions £, to be con- 
vergent “in mode one” in case the norms are bounded, and the 
sequence £,(p) converges for each p. He showed that con- 
vergence in mode one is equivalent to weak convergence as 
defined above, and that every bounded set of modular functions 
is always weakly compact. 
An important theorem on sets of functional operations is the 
following. Let the sequence of operations g,(x) defined on the 
complete normed linear space ¥ have the following properties: 
(1) | | |ar| | + |a2| |gn(xe) |. 
(2) For every 1, | gn(x) | is continuous in x. 
(3) For every x, g,(x) is bounded with respect to n. 
Then the operations | £n(X) | are continuous in x uniformly with 
respect to 2. A more general form of this theorem was proved by 
Hildebrandtf [19]. An immediate consequence of the theorem is 

} The general form of the theorem follows readily by an indirect proof 
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the proposition that the limit f(x) of a sequence of linear con- 
tinuous transformations f,(x) convergent for every point of the 
space X is also continuous. Another consequence is the proposi- 
tion that a multilinear transformation which is continuous in 
each of its arguments separately is continuous in all its argu- 
ments together. 

There are several conceptions of differentiation which are 
useful in the functional calculus. Perhaps the first to be used was 
the very general notion of a variation, which gave to the cal- 
culus of variations its name. This notion has passed through 
several stages, and we shall notice only one modern form of it. 
Let f be a functional transformation defined on an open set Xo 
in a normed linear space ¥, with functional values in another 
such space 9). Suppose that 

(D,) for every element 5x in ¥ there exists the derivative 

=0 


f(x + adx) = 6f(x, 6x). 
da 


This function 6f(x, 6x) is then called the variation of the function 
f at the point x. 

Note that we have used here the notion of the derivative of a 
function g(a) of a numerical variable whose functional values 
lie in a normed linear space. The definition of such a derivative 
is exactly analogous to the one used for numerically-valued 
functions and it has similar properties. A like statement is true 
concerning the Riemann integral of such a function g(a) [see 
Graves, 13], and we shall have several occasions to use such 
integrals. Various definitions have also been given for the cor- 
responding generalization of the Lebesgue integral, but we shall 
not need them. 

It may readily be shown that the variation 6f(x, 5x) of a func- 
tional transformation f(x) as defined by statement (D:) always 
has the property that it is homogeneous of the first degree in 
dx. But it may fail to have many of the usual properties asso- 


from the special case stated above. The special case in turn is an easy corollary 
of the theorem that a complete metric space is of the second category, and the 
following theorem: If the sequence of non-negative and lower semi-continuous 
operations f,,(x) is bounded with respect to m for each point x of a set Xo of 
the second category, then there is a sphere K in the space ¥ on which the opera- 
tions f,(x) are bounded uniformly with respect to x and n [Banach, 2, p. 19]. 


= 
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ciated with derivatives of functions of a single variable, as is 
shown by the fact that every function f(x) homogeneous of the 
first degree has a variation at the origin, 6f(0, 6x) =f(6x). In 
order to obtain a theory of differentiation with more content, 
we may suppose that the variation 6f(x, 6x) of a functional 
transformation f(x) has one or more of the following additional 
properties: 

(De) 5f(x, 6x) is linear in 6x. 

(Ds) 5f(x, 6x) is continuous in 6x. 


(Dg) 5f(x, 6x) is defined and continuous with respect to x 
at each point x in Xo uniformly for ||6x|| =1. 

In case a property (D) holds for every point x of Xo we shall 
denote the corresponding property by (D’). It may be shown 
that a function 6f(x, ix) having the relation to f(x) expressed in 
properties (D2) and (D,) always has property (D;) also. More- 
over, properties (D/ ), (D?), and (Dj) imply (D2), provided the 
space ¥) of functional values is complete. When (D2), (D3), and 
(D,) are satisfied, we say that the function f has a fotal differ- 
ential 5f(x, 5x) at the point x. The importance of the notion of 
total differential has been emphasized by Fréchet on various 
occasions. For example, a function f having a total differential 
at a point x is always continuous at x9. Moreover, if also g(z) 
has a total differential at zo, and g(zo) =o, then the function 
flg(z)] has a total differential at zo, equal to 5f[ x0, 5g(zo, 5z) J. 
These theorems do not hold for functions having only variations 
in the weak sense expressed by condition (D,) alone. However, 
we shall see later that when the function f is continuous and the 
derivative in (D,) is taken with respect to a complex variable 
a, then property (D7 ) implies all the others, provided again that 
the space 9) of functional values is complete. 

The differentiability properties of a functional operation or 
transformation f(x) may change when the continuity properties 
of the space ¥ are changed by altering the norm ||x||. For an 
example we may take a calculus of variations integral 


fle] = f 46, », 
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where the space ¥ consists of all functions x(t) of class C’ and 
vanishing at ‘=0 and f=1, and ||x|| = max | x(¢) |. Then 


1 
df[x, dx) + dt 
0 


has properties (D,) and (D2). Property (D3) also holds when x(t) 
is of class C’’, but an example shows that it may fail if x(¢) is 
merely of class C’. Property (D,) fails in general, since the func- 
tional operation f[x] is not usually even continuous. However, 
if we set ||x||= max {| x(t)|, |x’(¢)|}, then in our example 
df[x, 6x] has all of properties (D/), (Dj), (Di), (Dg) in a 
suitably chosen region Xo, and the operation f[x] is itself con- 
tinuous. 

The variety of useful definitions of differentiation is greater 
when we consider variations of higher order. There is time to 
mention only a few. We shall say that a transformation f(x) 
has an mth variation 6"f(x, 6x) at a point x in case for every 5x 
there exists the mth derivative 


a=0 


= 6*f(x, 6x). 


| 
| 


bx 


The ath variation 6"f(x, 6x) is always homogeneous of degree n 
in 6x. Moreover, if we set 


1 
Razi(%, bx) = f(x + bx) — f(x) — 6x) — 6"f(x; 6x), 
n! 
we have 
|| adx)|| 0 
a=0 a 


There is valid also a generalization of Taylor’s theorem [Graves, 
13, p. 173], namely, if f(x) is defined and has an nth variation 
at each point of a convex region Xo, and if the space 9) in which 
the functional values lie is complete, and if x and x+4x lie in 
Xo, and 6"f(x+ t6x, 5x) is a continuous function of ¢, then 


R,(x, 6x) = + téx, — #)™"'dt/(n — 1)!. 
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We might also say that a transformation f(x) has a second 
Variation at Xo in case f has a first variation 5f(x, 6x) for each x 
near Xo, and 6f(x, 6x) has a first variation §f(xo, 5x, 5:x) at xo 
for every 6x. The mth variation would then be defined by induc- 
tion, and would involve m independent variations 6,x, - - - , 5,x. 
In case d5f(x, 5x) has the properties (D/), (Dj), and (D{/) of a 
total differential, and 6°f(xo, 6x, has the properties (D2), 
(D;), and (D4) for each 6x, according to Fréchet 6?f(xo, 5x, 51x) 
is called the second differential of f at xo. In this case it may be 
shown that 6?f(xo, 6x, 6:x) is a symmetric function of 6x, 6,x, 
and hence that it is linear and continuous in 6x and in 6,x. 
When the space ¥ is complete, it follows that 6?f(xo, 6x, 5x) 
is continuous in 6x, 6x together. 

A function f(x:, - - - , x,) of a finite number of real variables 
is commonly said to be of class C‘” on an open region in the 
x-space when it has continuous partial derivatives on that region 
up to and including those of the mth order. We may generalize 
this notion to our abstract situation as follows [Hildebrandt 
and Graves, 18, pp. 135-144]. We say that f(x) is of class C’ 
on an open region Xo in case f has a total differential 5f(x, 5x) 
at every point of Xo, which also satisfies condition (D;’). The 
function f is of class C™ on Xo if f is of class C’ and 6f(x, 5x) 
is of class C~1) uniformly for || 6x||=1. When n23, this im- 
plies, for example, that 5*f(x, 6.x, 5.x, 63x) is continuous in 63x 
uniformly for ||5,x|] =||Sex||=1. Using this definition it is pos- 
sible to prove, for example, that if f(x) is of class C(™ on Xo, and 
if g(z) is of class C‘” on Zo and has its functional values in Xo 
then the function A(z) =f[g(z)] is of class C™ on Zo [Hilde- 
brandt and Graves, 18, p. 144]. While the proof of this theorem is 
rather difficult, the other formal rules of the differential cal- 
culus which have meaning in this general situation are readily 
derived. 

In case ¥ and 9) are complete normed linear spaces with the 
complex number system 9% as the associated number system, we 
may develop an interesting theory of analytic functional trans- 
formations f on a region Xo to 9). The essential features of this 
were indicated in certain special cases by Gateaux [11, 12]. 
Let us consider first the case of a function f(z) on Ao to 9), where 
Ao is an open set in the complex plane. The definition of de- 
rivative of f(z) and of ordinary point for f(z) have the usual 
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form. A function f(z) is said to be holomorphic on a set S in case 
every point of S is an ordinary point for f(z). Line integrals of 
such functions have the usual elementary properties, and 
Cauchy’s theorem and integral formula hold true. A Laurent 
expansion for f(z) about an isolated singularity may be derived 
in the usual way, and the notion of analytic continuation also 
generalizes. All this was indicated by Wiener [31]. 

Let us turn now to the general case of a functional trans- 
formation f(x) defined on an open set Xo in a normed linear 
space. We say that a point x» is an ordinary point for f in case f 
is continuous and has a first variation at each point x near xo. 
If each point of Xo is an ordinary point for f(x), then f is holo- 
morphic on Xo. 

A transformation f(x) on ¥ to 9) is called a polynomial trans- 
formation in case f(x;+ Axe) is a polynomial in the numerical 
variable \ with coefficients in the space 9), for every x; and x2 
in X. This definition reduces to the ordinary meaning of the 
word polynomial in case f is a numerically-valued function of a 
finite number of numerical variables. A continuous and homo- 
geneous polynomial transformation P,(x) of degree m may be 
polarized to give a multilinear form Q(x, x2, - - - , x») linear and 
continuous in each of its arguments, such that the relation 
Q(x, ---,x)=P,(x) holds. The kth differential of P,,(x) is then 
QO(6\x,---, x, ---, x) if Rn, and vanishes if k>n. 

Let the transformation f(x) be defined by a series of the form 


(1) f(x) = (x). 


There is a theorem on the nature of the region of convergence of 
such a series, analogous to Abel’s theorem for ordinary power 
series. Let X 9 denote the open set remaining after discarding the 
boundary points of the set on which the series (1) converges. 
Then if the point x is in Xo and | A | <1, the point dx is also in 
Xo. We shall call a region Xo having this property a region of 
circular type about the origin. In order to obtain interesting 
properties, we assume that the series (1) converges uniformly 
on sets E which are compact on Xo. An equivalent condition 
states that every point x» of the region Xo has a neighborhood on 
which the series converges uniformly. It follows that the sum 


0 
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f(x) of the series is holomorphic on Xo, and in fact of class 
om. 

Conversely, we have also a Taylor’s theorem. That is, if a 
transformation f(x) is holomorphic on X 9, then about each point 
xo in the region Xo the transformation f(x) is expansible in a 
series 


0 


of continuous homogeneous polynomial functions of (x—<x»), 
converging uniformly on a neighborhood of xo, so that f(x) is 
necessarily of class C‘). In case Xo is a region of circular type 
about the point xo, the Taylor expansion (2) for f(x) converges 
uniformly on sets E compact on Xo. Also ILF(x)|I, being con- 
tinuous, is bounded by a constant M for \|x—29 <p, where p 
is sufficiently small, so that by Cauchy’s integral formula, 


||  — < — 


The sum of a series of functional transformations, f(x) 
=) fa(x), where each f,(x) is holomorphic on Xo, is also holo- 
morphic on Xo, provided the series converges uniformly on sets 
E compact on Xo. Moreover, if f on Xo to 9) is holomorphic on 
Xo and g on U, to Xo is holomorphic on Uo, then the trans- 
formation h(u) =f[g(u)] is holomorphic on Up. If f(x, u) is a 
function of two points x and u which is continuous in x and u 
together and holomorphic with respect to x and u separately 
for x in Xo and uw in Up, then f(x, u) is a holomorphic function 
of the composite variable (x, u), and conversely. Finally, we 
may make use of the process of analytic continuation. However, 
we do not expect to secure much information about the nature 
of the singularities of an analytic functional transformation 


f(x). 


Part II. ImpLticir FUNCTION THEOREMS AND APPLICATIONS 


We may apply the preceding theory to study the solutions of 
various sorts of equations. Thus it may be desired to solve the 
equations 


film, Yn) = 0, (i = n), 


— 
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for the y’s as functions of the x’s. This system of equations 
may be written f(x, y) =0, where the range of x is a region of the 
m-dimensional x-space, the range of y is a region of the n-di- 
mensional y-space, and the functional values of f are in the y- 
space. The differential equation f(x, y, dy/dx) =0 has for the 
unknown a function y=y(x) in the space of functions having 
continuous derivatives. The dependence of the solution y(x) on 
the function f may be investigated by the methods of the func- 
tional calculus [Bliss, 4]. From our general point of view, 
algebraic equations, differential equations, integral equations, 
and many other types are all gathered under one head. How- 
ever, we may still wish to consider a variety of hypotheses, and 
there are different methods available. 

The simplest method*is that of successive approximations. 
Let (2), p) be a complete metric space, and let (yo), denote 
the neighborhood of yo consisting of those points y for which 
p(y, Yo) <a. Consider an equation of the form 


(3) y = F(y), 
where the transformation F is a function on (yo)4 to 2) with the 
properties: 


(a) It satisfies a Lipschitz condition with constant k< 1, that 
is, p(F(y1), F(y2)) 

(b) p(F(yo), yo) <(1—k)a. 

Then the method of successive approximations shows at once 
that there is a unique solution of equation (3) in the neighbor- 
hood (yo)a; that is, there is a unique fixed point of the trans- 
formation j = F(y). In case the transformation F depends on a 
parameter x, which may be a point in an accessible or a metric 
space, the solution of equation (3) becomes a function y= Y(x), 
and the theorem is properly called an implicit function theorem. 
If the function F(x, y) is continuous in its two arguments, then 
the solution y= Y(x) will also be continuous [Hildebrandt and 
Graves, 18, pp. 133-135]. 

Suppose we wish to consider an equation of the form G(y) = 9, 
where G is a transformation defined on a neighborhood (yo) 
with functional values in a space 3, and 9) and 3 are complete 
normed linear spaces. We may apply a generalization of New- 
ton’s method to obtain a solution (a process which is equivalent 
to reducing the equation to the form (3) and applying the 
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method of successive approximations), provided: (a) the point 
yo is a sufficiently close approximation to a solution of the equa- 
tion, (b) the function G has a differential 5G(yo, dy) at yo, and 
(c) this linear transformation 5G(yo, 5y) on Y) to 3 has an inverse. 

The question of the existence and properties of an inverse for 
a linear transformation G has been the subject of extensive in- 
vestigations. I shall mention only a few elementary theorems. 
The first states that if a continuous linear transformation G has 
an inverse H, then H is also linear and continuous [Banach, 2, 
p. 41]. The second states that if the space 9) is complete, and if 
the continuous linear transformation G of 9) into all or part of 
itself has norm M(G) less than unity, then the transformation 
K(y) =y—G(y) always has an inverse which may be expressed 
in terms of the usual series of iterates of G [Banach, 1, p. 161; 
Hildebrandt and Graves, 18, p. 145]. The third may be stated 
as follows. Let the space 9) be complete, and let G(y, w) be a 
continuous linear transformation of 9) into 3 which depends also 
on a parameter w ranging over an accessible space. Suppose that 
G is continuous in w at w=w» uniformly for ||y|| =1, and that 
G(y, wo) has an inverse. Then for w in a sufficiently restricted 
neighborhood of wo, G(y, w) has an inverse H(z, w), and H is 
continuous at w=w uniformly for ||z||=1 [Hildebrandt and 
Graves, 18, p. 146]. 

A general implicit function theorem may be stated as fol- 
lows. Let X, 9), and 3 be normed linear spaces, of which 9) and 
3 are also complete. Suppose that G(x,y), with functional 
values in 3, is of class C™ on a region R in the composite space 
(¥, 9), and let (xo, yo) be an initial solution of the equation 


(4) G(x, y) = 9 


at which the partial differential 6,G(xo, yo; by) has an inverse. 
Then in a sufficiently small neighborhood (yo) the equation (4) 
has a unique solution y= Y(x) defined on a neighborhood (x9)s, 
and the solution Y(x) is also of class C‘™ [Hildebrandt and 
Graves, 18, p. 150]. 

From our discussion of analytic functional transformations it 
is plain that when the transformation G(x, y) is analytic, the 
solution y = Y(x) is also analytic. 

It is frequently of interest to know how far a solution y = Y(x) 
of equation (4) may be extended. Let us define a sheet of points 
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in the composite space (X, 9)) to be an arcwise connected set 
S with the additional property that for every point (x,¥:) 
of S there are positive constants a and b such that no two points 
of S in the neighborhood (x, y:)2 have the same projection x, 
and every point x in the neighborhood (x), is the projection of a 
point (x, y) of S contained in (x;, y:)4. Then under the hypoth- 
eses of the theorem just stated there is a uniquely determined 
maximal sheet S of solutions of equation (4) which (a) passes 
through the initial solution (xo, yo); (b) is composed entirely 
of points (x, y) at which the differential 5,G(x, y; dy) has an in- 
verse; and (c) has for its boundary points only boundary points 
of the region R and points at which 6,G(x, y; dy) ceases to have 
an inverse. This theorem is readily proved by defining S as the 
logical sum of all sheets having the properties (a) and (b). It 
may be noted that a sheet of solutions determines a single- 
valued function Y(x) in a neighborhood of each of its points, and 
each of these functions is of class C‘™. 

Various applications of the general implicit function theorem 
described above have been made to differential, integral, and 
integro-differential equations, and to the calculus of variations 
[Graves, 14, pp. 535 ff.; 15; 16, p. 677]. The theorem is a very 
powerful one, and simple in application. The principal problem 
in any special case is to determine when the partial differential 
5,G(x, y; 6y) hasan inverse. Let us consider a simple application 
to an ordinary differential equation with boundary conditions, 


(5) y”’ = hit, y, y’), y(a) = A, y(b) = B. 


Let 9) be the space of all real functions y(t) of class C’’ on 
a <t<b, with ||y|| defined in the usual way as the maximum of 
| y(t) |, | y’(#) | y’"(t) | on the interval. Let be the space of 
all real functions h(t, y, y’) continuous in ¢ and of class C‘” in 
y and y’, for (t, y, y’) in a certain bounded region R. Let ||h|| be 
the maximum of | d+ih/dy'dy’® | for (t, y, y’) in R and i4+7 
=0,---,m. Let 3 be the composite space consisting of all 
continuous functions 2(f) and pairs of numbers A, B, with 
\|z, A, B|| =maximum of | 2(¢) |, | A |, | B|. Then the equations 


yy)=20, ya)=A, = B, 


define a transformation G(y, 1) of class C‘™ on a region of the 
composite space (9), ) to the space 3. In this special problem 


— 
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a necessary and sufficient condition that the partial differential 
5,G(y, h;5y) have an inverse transformation is that the equation 
of variation 


(6) by” — — = 0, dy(a) = 0 = 


be incompatible, that is, have only the solution 6y=0. Hence if 
an initial solution of equation (5) is known at which (6) is in- 
compatible, this solution may be continued, allowing the func- 
tion h and the end values A and B to vary until a point is 
reached which lies on the boundary of the region of admissible 
points, or at which equation (6) becomes compatible. This solu- 
tion y(t) = Y[h, A, B|t] will be of class C™ as a function of h, 
A, and B. 

This result may be applied to the calculus of variations as 
follows. Let y= y(t) be an extremal for an integral 


b 
f S(t, y, y’)dt 


joining two points (a, A) and (6, B). Suppose that 
(7) Sy Sut, Suy 


have continuous partial derivatives up to order m with respect 
to y and y’, and let ||f|| be defined as the maximum of the ab- 
solute values of the partial derivatives (7) and their partial 
derivatives up to order m. Suppose that f,,+0 along the 
initial extremal, and that the end points of this extrema! are 
not conjugate. Then when A, B, and the integrand f are varied, 
the extremal joining (a, A) and (0, B) will vary continuously and 
be of class C‘™ as a function of f, A, and B, and the process may 
be continued until an extremal arc is reached which either is 
singular, that is, has f,,y=0 at some point, or else has end 
points which are conjugate. If the initial extremal is a minimiz- 
ing extremal, then the varied extremal will continue to furnish 
at least a weak relative minimum for the integral. 

The method described may be applied also to partial differ- 
ential equations of elliptic type. An essential theorem for that 
purpose is the one which states that if a linear elliptic partial 
differential equation of the second order has at most one solu- 
tion taking given boundary values, then it always has a solution 
for arbitrary continuous boundary values, at least when the 
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shape of the boundary is sufficiently restricted. Schauder has 
recently given a proof of this based on the functional calculus 
and on the known solvability of the equation 


(8) D(u) = u,, + ty = f(x, y), (u = ¢ on the boundary), 


and known inequalities affecting the solutions [28]. Schauder 
gives the theorems for the case of m independent variables, but 
for simplicity they are stated here for the case =2. Inequali- 
ties affecting the solutions of the equation 


(9) L(u) = a(x, y)u,, + 2b(x, + ¥)ty = f(x, ¥), 
(u = ¢ on the boundary), 


where a>0, ac—b?=1, are first derived. All the functions 
appearing in (9) are supposed to be a-Hélder continuous, 
(0 <a <1), and the norms || ||f||, and ||¢|| involve the Hélder 
constants of those functions. The coefficients a, b, and c are 
supposed to be 8-Hélder continuous, where 6B >a. The inequali- 
ties proved yield a type of weak compactness of the set of solu- 
tions of the one-parameter family of equations 


(1 — A)D(u) + AL(u) 


(10) (A, PA) 


(wu = ¢ on the boundary), 


where P(A) is a certain function which is continuous and not 
zero for 0S <1, and ||f|| is bounded. Hence if (10) has a solu- 
tion “=u, for \=X,, and X, approaches \, then a subsequence 
of the sequence u, is weakly convergent to a solution u which 
also satisfies the inequalities. Now the transformation K(A, u) 
is continuous in \ and uw and linear in u, so that it will continue 
to have an inverse for \ near a value Xo for which it is known 
to have an inverse. Since the family (10) reduces to equation (8) 
for \=0 and to equation (9) for \=1, and equation (8) always 
has a unique solution, it follows that equation (9) always has a 
unique solution. Let the inverse transformation so defined be 
denoted by u=7(f). Then the general elliptic linear equation 


(11) L(w) + d(x, + e(x, y)uy + g(x, yu = h(x, 9), 
(u = ¢ on the boundary), 
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may be transformed into 

(12) f=k—d—T(f) —e—T(f) — 
Ox oy 


and the transformation S(f) turns out to be completely continu- 
ous. Thus by use of the Riesz theory of linear transformations 
of the form (12), where S(f) is completely continuous [Riesz, 
24; Hildebrandt, 21; Schauder, 26], it follows that if (11) has 
at most one solution, then it always has a solution. 

The foregoing proofs are made first for the case when the 
boundary values ¢ are supposed to have continuous second de- 
rivatives satisfying a Hélder condition, and the theorems are 
extended to the case of arbitrary continuous boundary values 
by use of certain additional inequalities. This proof by Schauder 
is a good example of the advantages offered by the general meth- 
ods of the functional calculus. No use is made of an elementary 
solution or of Green’s function. 

A number of other fixed point theorems and implicit function 
theorems have been proved for general spaces by topological 
methods. We remark that topological methods yield no informa- 
tion about the differentiability of functions defined implicitly. 

Throughout the following paragraphs it is understood that 
the spaces ¥ and 9) are complete normed linear spaces. 

A very general fixed point theorem is as follows. Let f be a 
continuous transformation of a convex closed set S in the space 
X into a part of itself, such that the transformed set f(S) is 
compact. Then the transformation f leaves one point of S fixed. 
A fixed point theorem for n-dimensional space was proved by 
Brouwer. The proof of the general theorem depends on an ex- 
tension of the Brouwer theorem, and is made by defining trans- 
formations f, which approximate the transformation f on n- 
dimensional subsets of the set S. The first extension to function 
spaces was made by Birkhoff and Kellogg in 1922 [3]. Theorems 
for general spaces were proved later by Schauder [25]. 

The following theorem on inversion of functional transforma- 
tions is readily proved. Let y = F(x) be a transformation defined 
and continuous on an arcwise connected region Xo of the space 
%, which is locally invertible, that is, the equation y = F(x) has 
a unique continuous solution near each of its solutions. Suppose 
also that whenever the sequence F(x,) converges, the sequence 
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(x,) is compact on Xo. Then F is a topological transformation 
of Xo into the whole space 9). It is clear that a fixed point 
theorem for a transformation G(x) may be obtained as a corol- 
lary by setting F(x) =x—G(x). The theorem may be proved as 
follows. Let Yo=F(Xo0), and let (y,) be a sequence of points of 
Yo converging to a point y’. Then the corresponding sequence 
(x,) is compact on Xo and hence there is a subsequence (xp,) 
having a limit x’ in Xo. Since F is continuous, y’ = F(x’). Hence 
the set Yo is closed. It is also open, since the transformation is 
invertible near each of its points. Hence Yo=¥). If the inverse 
transformation were not single-valued, there would be a curve 
B in Xo joining two distinct points whose image C in the space 9) 
is a closed curve. By a continuous deformation of the curve C 
into a point the curve B must also be deformed into a point, 
which easily leads to a contradiction. Essentially this theorem 
and proof were given by Paul Levy in 1920 [23]. Caccioppoli 
has recently given an exposition of it in general form [5]. 

Schauder has recently proved theorems showing that under 
certain circumstances the uniqueness of the solution of an equa- 
tion y = G(x) implies the existence of a solution (x, y) for every y 
near yo, where (xo, yo) is a solution. One of his theorems is as 
follows [27, p. 686]. Let there be defined in the space ¥ a weak 
convergence satisfying the postulates set down in Part I. Sup- 
pose also that every bounded set is weakly compact. Let F and 
H be transformations defined on an open set Xo of X, and let F 
be totally continuous, in the sense that it transforms a weakly 
convergent sequence into a sequence converging according to the 
norm. Suppose that // satisfies a Lipschitz condition with con- 
stant k<1, and transforms a weakly convergent sequence into 
a weakly convergent sequence. Then if G(x) =x+ F(x)+H(x) 
establishes a one-to-one correspondence between the open set 
Xo and the transformed set G(X), G(Xo0) is also an open set. 
Again the proof is made by constructing approximating trans- 
formations defined in subspaces of a finite number of dimen- 
sions. Schauder applied this theorem to show that in case a par- 
tial differential equation of elliptic type 


F(x, y, 2, p,q, 7, S, t) = W(x, y), (z = ¢ on the boundary), 


is known to have at most one solution, and if it has a solution 
for Y =o, 6 =¢o, then it has a solution for Y, @ near Wo, do. 
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Another implicit function theorem has been given by Leray 
and Schauder [22] for equations of the form 


(13) x = F(x, k). 


Let R be the closure of a bounded open set R in a space %, and 
let K be a finite closed segment of the real axis. Let the trans- 
formation F be defined and continuous on RK, and continuous 
in k uniformly with respect to x. Moreover, let the set of func- 
tional values F(R, K) be compact, and suppose equation (13) 
has no solution with x on the boundary of R. Suppose that at 
an initial point ko of the interval K, the equation (13) has a 
finite number of solutions x =x;, and that the sum of the “in- 
dices” of these solutions is not zero. This last hypothesis is satis- 
fied, for example, in case there are an odd number of solutions 
near each of which the equation y =x — F(x, ko) has at most one 
solution. Then equation (13) has a continuum of solutions (x, k) 
on which takes all the values of the interval K. This theorem 
has the advantage of being a theorem in the large, and of not 
requiring uniqueness of the initial solution, but the other hy- 
potheses are rather stringent. The proof again depends on trans- 
formations F, which approximate the transformation F on sub- 
sets of R of a finite number of dimensions. 

All of the theorems which have been quoted have their appli- 
cations to boundary value problems for differential equations 
and to integral equations, both linear and non-linear. The field 
seems to be a fruitful one for further study. 
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ON A THEOREM OF PLESSNER 
BY W. C. RANDELSt 
Plessner{ has shown that if f(x) ¢ Lz on (—7, 7) and 


f(x) + (ad, cos nx + b, sin nx), 


n=1 
then 
(1) > (a, cos nx + b, sin nx)(log n)~1/2 
n=2 


converges almost everywhere on (—7, 7). We designate the set 
where (1) converges by E(PI, f). This set is then known to be 
of measure 27. The sets E(F, f), consisting of the points where 


g(t) = f(x +4) + f(x — — 2f(x) 0, as t- 0,7 
and E(L, f), consisting of the points where 


&(t) -{ | o(r)| dr = o(f), as 


are of much importance in the theory of Fourier series. The set 
E(L, f) is known to be of measure 27 for all integrable functions. 
It is obvious that 


E(f, t)c EZ, f). 


We propose in this note to investigate the inclusion relation- 
ships between these sets and E(PI, f). We shall prove 


(2) E(F, f) ¢ E(Pl, f), 
and 
(3) E(Pl, f)¢ E(L, f). 


We first consider (2). Plessner§ showed that, if (1) converges, 


{ Sterling Research Fellow. 
t A. Plessner, Journal fiir Mathematik, vol. 155 (1926), pp. 15-25, 
§ Loc. cit., p. 22, 


= 
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(4) S,(x) =— ae > (a, cos vx + b, sin yx) = of (log m)il2} 


v=1 


However, it is well known that, for a continuous function, the 
estimate 


(5) Sn(x) = o(log 1) 


cannot be improved. This implies that (4) need not be satisfied 
at every point of continuity and hence 


E(F, f) ¢ E(PI, f). 


In order to prove (3) we shall construct a function f(x) ¢ L. 
on (—7, 7) for which (1) converges at x - 0 but such that 


(6) f | o(r)| dr ¥ o(t) as t 0. 


The function is similar to one constructed by Paleytf for another 
purpose. We define f(x) by 


(x{(« — n)n log n}™, if =| — = 


f(x) = 


lo, elsewhere on (0, z), 
\f(— x) forO > x= — 


Then, since 
fi f(x) |'dx (log 2) | 
=O n— (log nyt, 


n=1 


f(x) cL. on (—7, 7). We have at x=0, ate =2f(t) and 


a> (n? log n)- —= 


n=[1/t]+1 n=[1/t]+1 


+ P. Du Bois Reymond, Abhandlungen der Bayerische Akademie, vol. 12, 
part 2 (1876). There is a simplification of Du Bois Reymond’s method given 
by Lebesgue, Lecons sur les Séries Trigonométriques, 1906, pp. 84-86. 

t R. E. A. C. Paley, Proceedings Cambridge Philosophical Society, vol. 26 
(1930), pp. 173-203; see §10, pp. 201-203. 


(n = 3,4,---), 


1935] A THEOREM OF PLESSNER 665 


for ¢ sufficiently small. Now we consider S,,(0). It is well known 


that 
1 sf sin (m + 1/2)t 
S»(0) = o(t) dt 
2 sin t/2 
-= fs dt + O(1), 
while 
sin 
Sm*(0) = 
1 2 sin mt 
_sinmt 
n'+n* sin mt 
But 
sin mt = sin (mn) cos (m(t — n—)) 
+ sin (m(¢ — cos (mn), 
so that 


* sin mt 


TSn*(0) = 4 (n log cos (ne) 


n=3 n=[m/ n=[m/7}41 
= 1, + I, + Is. 
Now, if ma<1,a>b>0, 
@ sin a ma sin mt 
dt = O{ma}, 


and, if mb>1, a>b>0, 


2 sin mt ma sin ¢ f 1 
J dt -{ dt = O <—? = O(1). 
b t mb t Amb 
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3 m3 


n=3 m logn 
mil? 
I,=0O (n log 
n=[m"/9}41 
= O} log log m‘!? — log log m'*} = O(1) 


I;=0O {m > (n log 


Therefore 
Sm(0) = S,*(0) + O(1) = O(1). 
We now apply Abel’s partial summation to (1) and get 
> (S, — Sn-1)(log 
= Si(log 2)-"/? + { (log — [log + 1)]-¥?}, 


n=2 


since S,,(0) =O(1) and (log 2)—!/?-0 as n—~. But since log n 
is monotone, 


Ms 


{ | (log n)-¥/2 — (log [n + 1]-1/2| } = (log 2)-'/?, 


n 


and therefore (7) converges. This means that the point x=0 

is contained in E(PI, f). But since we have already seen that 

the point x =0 is not contained in E(L, f), this proves that 
f)¢ f). 


YALE UNIVERSITY 


Hence 
=0 = o(1). 
n=[m"/2}41 J 
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A REMARK ON METHOD IN 
TRANSFINITE ALGEBRAt 


BY MAX ZORN{ 


The theorems of Steinitz concerning algebraic closure and the 
degree of transcendence are barred, from the algebraic point of 
view, by the well-ordering theorem and its theory. We wish to 
show how, by introducing a certain axiom on sets of sets instead 
of the well-ordering theorem, one is enabled to make the proofs 
shorter and more algebraic. The proofs will be given in terms 
of the non-axiomatic standpoint of set theory. 

DEFINITION 1. A set 8={B} of sets B is called a chain, if 
for every two sets B,, Be, either B, > Bz or Bz > By. 

DEFINITION 2. A set % of sets A is said to be closed (right- 
closed), if it contains the union >-z>8B of every chain $ con- 
tained in %. 

Then our maximum principle is expressible in the following 
form. 

(MP). In a closed set X of sets A there exists at least one, A*, 
not contained as a proper subset in any other AEX. 

App.icatTions. I. Let ® be a ring with a unity element 1; let 
W be the set of all ideals a (i) not containing 1 as an element, 
(ii) containing a certain ideal r~R. The set % is obviously 
closed; the maximum principle implies the existence of a maxi- 
mal ideal p with r € p ¢ ® $ py; this ideal isa prime ideal and 
the residue class ring R/p is a field. 

II. If & is a real field, that is, a field such that no sum of 
squares vanishes unless all the squares vanish, and K is an arbi- 
trary algebraical extension, then the set of all real fields between 
k and K is closed, so the MP assures the existence of a maximal 
real field between k and K. In particular, if K is algebraically 
closed, we obtain a real closure of k. 

III. Let K be an arbitrary field extension of k. A set of K- 
elements {a} is said to be algebraically (respectively, linearly) 
independent, if no finite subset a, d2, @3, - - - , Gn satisfies an 


+ Presented to the Society, October 27, 1934. 
t Sterling Fellow. 
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algebraic (linear) relation with coefficients in k (not all vanish- 
ing). The set of all these independent sets is closed; a maximal 
independent system, which exists as a consequence of the MP, 
is a basis for K/k with respect to algebraic (linear) dependence. 
(The theorems concerning dependence are consequences of the 
dependence axiomsf only.) 

We shall first prove the existence of an algebraic, algebraically 
closed field extension. Indeed, we shall construct for each set of 
polynomials { p(x) } with coefficients in k a minimal decomposi- 
tion field. (Let p have the highest coefficient 1 and degree n,.) 

We form the field 2’ of all rational functions in x over k. Over 
k’ we consider the domain of all polynomials in the following 
set of indeterminates: y,,;, the first index p ranging over all poly- 
nomials of our set, the second over the integers from 1 to mp. 

In this domain of integrity let us take the ideal generated by 
the polynomials 


| P(x) — (2 — — Joa) --- Jon} 


Suppose that this ideal does not contain the number 1. Then we 
have a prime ideal containing it, and the residue classes of the 
indeterminates y generate a minimal field over k, in which all 
polynomials p(x) are completely decomposed. 

If now this ideal should contain the unity element, we should 
have a representation of 1 as a linear combination of a finite 
set of differences p(x) —(x—vyp,1) - - - with coefficients rational 
in x and integral in the y’s. This relation in indeterminates y 
gives the contradiction 1 equal to 0, if we substitute for the 
indeterminates the roots of the finite set of polynomials p oc- 
curring in the relation; these roots exist in accordance with ele- 
mentary theorems in a suitable field extension. 

Let K,, Kz be two algebraic extensions of k, which are mini- 
mal decomposition fields for the set of polynomials { (x)} over 
k. We consider the partial realizations of the uniqueness theo- 
rem, that is, the isomorphisms between fields K? and K,’, which 
are the identity in the common subfield 2, K? lying in K;. The 
function tables of these isomorphisms, considered as sets of 
pairs, form a closed system (of sets of pairs); therefore a maxi- 
mal one must exist in accordance with MP. 


7 Van der Waerden, Moderne Algebra, vol, 1, p. 204. 
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This maximal isomorphism must range over the entire fields 
K;. For let K? be the fields connected by the maximal isomor- 
phism. Take an arbitrary polynomial p(x) in the set {p}. The 
minimal decomposition fields for this polynomial p(x) over K?, 
generated by adjoining respectively the roots in K;, are isomor- 
phic under an isomorphism which continues our maximal isomor- 
phism between K/° and K.°. (As in the case of the existence these 
proofs are based on the corresponding finite theorems; precisely 
as in the proof dealing with the degree of transcendence.) Hence 
the decomposition fields reduce to K* ; K decomposes all poly- 
nomials of the set and is therefore identical with K;; our maxi- 
mal isomorphism yields the uniqueness of the minimal decom- 
position field, in particular, the uniqueness of algebraic closure. 

Finally we prove the existence of the degree of transcendence. 
Let {x} and {y} be two bases of K/k. Once more we consider 
the partial realizations of our theorem. Namely, we take a (1, 1) 
correspondence between subsets {x}’ and {y}’ of {x} and {y}, 
where {x}’ and {y}/ are dependent. 

The function tables of these correspondences, considered as 
sets of pairs, form a closed system. We shall prove by the corre- 
sponding finite theorem concerning degree of transcendence and 
the axiom of choice that a maximal correspondence of this kind 
which exists in accordance to MP must range over the entire 
bases {x} and {y}. Take a correspondence with {x}’ different 
from {x}. Take a finite set of x’s not contained in {x}’. This 
will depend on {y}’ and a certain finite set of y’s which is not 
contained in {y}’. This finite set of y’s depends on {x}’, the 
arbitrarily chosen finite set of x’s and a certain other finite set 
of new x’s; proceeding in zigzag we obtain (choice axiom) two 
sequences of new x’s and y’s; we may proceed similarly, in case 
of a finite number (finite theorem) or an infinite (denumerable) 
set. It is not necessary to show the utility of the MP in further 
algebraic or analytic examples. 

In another paper I shall discuss the relations between MP, 
the axiom of choice, and the wel!-ordering theorem. I shall show 
that they are equivalent if the axiom yielding the set of all sub- 
sets of a set is available. It seems that without this axiom the 
MP is weaker than the well-ordering theorem; the situation may 
be studied by means of other axioms. 

In the present paper all the axioms concerning sets are used; 
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in the case of algebraic closure there is a stronger result of 
Hollkott (Hamburg): the axiom of choice is sufficient for the 
existence and uniqueness of algebraic closure. 

An essential simplification is made possible for Baer’st theory 
of the degree of algebraic extensions. I plan to show elsewhere 
how the generalized continuum hypothesis may be avoided. 


YALE UNIVERSITY 


CONCERNING TWO INTERNAL PROPERTIES 
OF PLANE CONTINUAT 


BY R. E. BASYE 


Theorem 1 below was suggested to me by R. L. Moore. 
Theorem 2 is an extension of Kuratowski’s result§ that if three 
compact plane continua have a point in common and their sum 
separates a point A from a point B in the plane, then there 
exists a pair of these continua whose sum separates A from B 
in the plane. Another extension of this result along com- 
binatorial lines has been given|| by Cech. 


THEOREM 1. Let H and K be two mutually exclusive and closed 
subsets of a compact continuum M which lies in the plane. If for 
each pair of points A and B in H and K, respectively, there exists 
a finite collection 4p of continua in M such that 4p separates A 
from Bin M, then there exists a finite collection T of continua in M 
such that T* separates H from K in M. 


Let 1, €2, - - - , be a sequence of positive numbers converging 
monotonically to zero, with e less than half the distance from 
H to K. For each i let Dj be a domain containing H such that 
(1) the boundary $j of Dj is the sum of a finite number of 
mutually exclusive simple closed curves, and (2) each point of 


{ Eine Anwendung der Kontinuumhypothese in der Algebra. Journal fiir 
Mathematik, vol. 162. 

t Presented to the Society, April 6, 1935, under a somewhat different title. 

§ Kuratowski, Théoréme sur trois continus, Monatshefte fiir Mathematik 
und Physik, vol. 36 (1929), pp. 77-80. 

|| E. Cech, Trois théorémes sur I’'homologie, Publications de la Faculté des 
Sciences de L’Université Masaryk, No. 144, 1931, pp. 1-21. 


— 
= 
= 
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Dj lies at a distance less than e; from H. For each i let Di bea 
similar domain containing K and having boundary Bi. We 
shall show that there exists a value of 7 such that the number of 
components of M—M-(Dj+D,) which intersect both Bf 
and 8; is finite. 

Suppose there are infinitely many such components for each 
value of 7. Consider for each 7 an infinite sequence Cy, C#,--- , 
of components of M—M-(D+D,), no two of which lie in the 
same component of M—M-(Dj+D,), and such that both 


Bi and Bg contain a limit point of C/(j=1, 2,---). There 
exists an infinite subsequence a‘ = - - - of the sequence 
C2’, ---, and two components jg, ik of Bx, respec- 


tively, such that each element of a‘ intersects both je and jx. 
No component of Bi or Bg distinct from ju and jx can inter- 
sect more than two elements of a‘. For suppose j is such a com- 
ponent, intersecting, say, the three elements E/, E#, E; of 
a‘. The continua are mutually exclusive, intersect 
j, and are not disconnected by 7. It follows that no two points 
of jy: Ej, ae 2, 3), are separated from each other in jj 
by Efi . Hence there exist three arcs be, b3 such 
that (1) = and (2) b,, (n =1, 2, 3), contains 
but no point of —E,!. Denote by (n=1, 2, 3), the 
continuum £,'+b,+j. Let 6; and 6¢ denote the complemen- 
tary domains of jj and jj which do not contain jg and jj, re- 
spectively. The three continua F,' , (n=1, 2, 3), have a point in 
common and their sum separates a point X of 6} from a point Y 
of 5; in the plane. Hence, by a theorem of Kuratowski,{ there 
exist two of these continua, say Ff and F#, such that Ff + Ff 
separates X from Y in the plane. But 6j+(E;' —j-E;')+6; 
is a connected set which contains X and Y but no point of 
Fi 
Consequently, the number of components of 6g+ 8x being 
finite, there exists an integer r such that each element of the 
sequence yi=E}, intersects both components jg, 
jx without meeting any other component of Bi+Bs. The 
intersection of each element E,' of y‘ with jg is contained in an 
arc (or point) of jg which intersects no other element of yi. 
Denote by a,’ the minimal such arc (or point), and by b,' a 


Loc. cit. 
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similar arc (or point) containing the intersection of E,' with 
jx. The sequence +b, (n=r,r+1,--- ), contains a sub- 
sequence which has a sequential limiting set; let P; and Q; de- 
note points of this limiting set which lie on jf and j¢ respec- 
tively. The sequence {P;+Q;} contains a subsequence which 
has a sequential limiting set; let P and Q denote points of this 
limiting set which lie in 7 and K, respectively. 

By hypothesis there exists a finite collection pg of continua 
in M such that pg separates P from Q in M. Hence M—T'pg 
= Np+ No, where Np and Ng are mutually separated sets con- 
taining P and Q, respectively. Let Rp and Rg be regions contain- 
ing P and Q, respectively, such that Rp- (No+T'pe) =Rg-(Np 
+T po) =0. Let i be chosen so that P; and Q; lie in Rp and Ro, 
respectively. Then there exists an infinite subsequence e'=Ei,, 

of such that and (t=1, 2,---), are sub- 
sets of Rp and Rg, respectively. Every element of e‘ intersects 
some element of I'pg. Therefore some element W of I'pg inter- 
sects at least two elements e€’, hence contains a point of BitBe. 
Hence, if E’,, denotes an element of €' which intersects W, then 
W contains a point of a‘, But ai, +i, is a subset of 
Rp+Reg, which contains no point of T'pg. 

It follows that there exists a value of 7 such that the collec- 
tion T of components of M—M-(Dj+D,) which intersect 
both 8; and 8 is finite. Denote by Uy the sum of those com- 
ponents of M—M-(Dj+D,) which intersect 8; but not B, 
and by Ux the sum of those components which intersect Bg 
but not 84. It can be seen that M-Dj+Uy and M-Di+Ux 
are mutually separated sets containing H and K, respectively. 
Hence I’* separates H from K in M. 

R. L. Moore has proved{ the closely related result whose 
statement is identical with that of Theorem 1 except that each 
I',, and I are taken as finite sets. The argument is valid in any 
space satisfying his axioms 0 and 1. An example showing that 
Theorem 1 is not true for three-dimensional continua has been 
given by W. T. Reid.f{ 

The conclusion of Theorem 1 will also hold if I'4, merely 


+ Foundations of Point Set Theory, Colloquium Publications of this Society, 
vol. 13, p. 140, Theorem 57. 
t See this issue of this Bulletin, pp. 683-684. 
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weakly disconnectst A from B in M, provided M is locally con- 
nected at each point of H+K. The argument is for the most 
part similar. 

Again, if we require only that '4z be a countable collection 
of continua in M such that fg is closed and separates A from 
B in M, then there exists a countable collection I of continua in 
M such that I* is closed and separates H from K in M. 


THEOREM 2. Let M be any subcontinuum of a plane or a sphere 
and let Z be any subset (possibly vacuous) of M. If G is a finite 
collection of connected subsets of M such that (1) there exist r 
points (r finite) whose sum L intersects each element of G, and (2) 
G*+Z weakly disconnects a point A from a point B in M, then 
there exists a subcollection H of G, containing not more than 2r 
elements, such that H*+Z weakly disconnects A from B in M. 


The case where M is a subset of a plane is a consequence of 
the case where M is a subset of a sphere. Suppose, then, that 
is a subcontinuum of a sphere S. We shall discuss in order the 
three cases that may arise. 

CasE 1. Suppose r= 1. Assume the theorem false. Let gy, - - - , 
gn denote the elements of G and let O be a point common to these 
elements. There exist continua C;;, (4, 7=1,--- ,), such that 
C;; contains A +B and has no point in common with g;+g;+Z. 
Let Di, (k=1,---,), be a connected domain which contains 
g, and contains no point of any C,;, (j=1,---, ). Let Do 
be a domain which contains Z but contains no point of any 
(i, 7=1, -- +, Let The domain (S—_M)+A 
weakly disconnects, hence separates, A from B in S. Hence 
there exists a component Q of (S—M)+A which, separates A 
from B in S, and Q must contain the connected set ah. 

If Q:;, (¢=1,---, 2), denotes that component of (S—/) 
+Do+D; which contains g; then Q=Qi+ --- +@Q,. For 
suppose the contrary and let R denote the set Q—(Qi+ --- 
+Q,). Since Qi+R is not connected, R contains a subset R; 
such that R; and (Qi:+R)—R;, are mutually separated sets. In 
general, if R; has been defined, Ri1, (#=1,---, m—1), will 


7 A set L weakly disconnects a set A from a set B in a connected set M 
if it contains a point of every connected and relatively closed subset of M 
which intersects both A and B. 
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denote a subset of R; such that and (Qi4:1+-R:) — Riz: are 
mutually separated sets. It follows that R, and Q—R, are mu- 
tually separated sets, contrary to the connectedness of Q. 

Let Fu, (¢=1,---,m;t=1, 2,---), be a subcontinuum of 
Q; which contains O and every point of Q; whose distance from 
the boundary of Q; is greater than 1/t. Let F;= + Fae, 
(t=1,2,--- ). There exists a value w of ¢ such that F,, separates 
A from B in S. For on the contrary supposition there exists a 
sequence of continua Ni, Ne, - - - , such that N;, (¢=1,2,---), 
contains A+B and contains no point of F;. The limiting set of 
this sequence is a continuum which contains A +B but no point 
of Qit+ --- +Q,. But this is impossible. Consequentlyf there 
exist two of the continua F;,, say Fi. and F2., whose sum sepa- 
rates A from B in S. Hence Q:+(Q2 separates A from B in S. 
But the continuum Cy, contains A+B and has no point in 
common with Q,+(Q2. 

CasE 2. Suppose that r>1 and G* is connected. There exists 
a subcollection K of G having not more than 27 —2 elements and 
such that K* is a connected set which contains L. Denote by 
G’ the collection of all elements each of which is the sum of K* 
and an element of G. The elements of G’ are connected sets hav- 
ing K* in common; hence, by Case 1, there exist two elements 
gi and g? of G’ such that gi +g?¢+2Z weakly disconnects A 
from B in M. The set gi +g¢ is the sum of 27 or less elements of 
G. 

CasE 3. Suppose merely that r>1. Let HW be a subcollection 
of G which is irreducible with respect to the property that 
H*+Z weakly disconnects A from B in M. Let G,---, Cy 
denote the components of H* and let 7;, (¢=1, - - - , g), be the 
number of points common to L and C;. By Case 2 there exist 
2r; or less elements of H lying in C;, (¢=1, - - - , g), such that, 
if 7; denotes their sum, 7;+ [Z+(H*—C;) ] weakly disconnects 
A from B in M. Hence C; cannot contain more than 27; elements 
of H. Therefore, since r=7,+ - - - +r,, H* cannot contain more 
than 27 elements of G. 


Tue UNIVERSITY OF TEXAS 


+ C. Kuratowski, loc. cit. 
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A NEW UNIVERSAL WARING THEOREM 
FOR EIGHTH POWERS 


BY ALVIN SUGAR 


1. Introduction. Hardy and Littlewood* in their proof of 
Waring’s theorem obtained a constant C= C(s, k) beyond which 
every number is a sum of s integral kth powers 20. Recently 
Dickson perfected an algebraic method by which he was able to 
show that every positive integer <C is a sum of s integral kth 
powers 20. Thus we are now able to obtain universal Waring 
theorems for relatively small values of s. 

We shall consider in this paper the problem of meeting the 
Hardy and Littlewood constant by Dickson’s method and estab- 
lishing a new universal Waring theorem for eighth powers. The 
earlier result for eighth powers was 575, obtained by Dickson.{ 

2. Proof of the Principal Theorem. We write 


(1) “a= 2°, b = 38, c= 48, d= §§, e= 68. 


The right side of 

m=n-+Aa+ Bh+---+Qq, (mn, A, B,--- ,Q integral), 
is a resolution of m of weight w(m) =n +A+B+ ---+Q. When 
n, A, B,---,Q20 the resolution is a decomposition. 

By division we obtain 
(2) b = 161 + 25a, c = — 74+ 100, d = 56 + 15a + 9b + Sc, 
(3) e= 21+ 224+ 75+ c+ 4d. 


Consider an integer M, such that 2d+e< M<3d-+e. We can 
express the integer P = M —2d—e uniquely in the form R+N, 
where 


(4) 0<R<a=256, N=Aa+Bb+Cc, 
(5) C = [P/c],t B = [(P — Cc)/b], A = [(P — Bb — Cc)/a]. 


* A simplified proof can be found in Landau, Vorlesungen iiber Zahlentheorie, 
vol. 1, 1927, pp. 235-360. 

+ This Bulletin, vol. 39 (1933), p. 713. 

t [x] denotes the largest integer <x. 


= 
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Since N <d, we obtain by (4) and (5) the inequalities Cc <d, 
Bb<c, and Aa<b. Hence 


(6) OSA < 26, 0=B< 10, <6. 
Since 
(7) M =R+Aa+t+ Bb+Cc+ 2+ e, 
then 
w(M)=R+A+B4+C+3 


8 

(8) S 255+25+9+5+3 = 297. 

Since (7) defines a decomposition of M, we can state the follow- 
ing lemma. 


LeMMA 1. Every integer M, such that 2d+eS M <3d+e, is a 
sum of 297 eighth powers. 


Let us now consider the problem of obtaining a smaller value 
for w(M). Table I contains a list of certain equations of the form 


(9) r=A’a+ Bb+Cc+Dd+ Er. 


Such an equation defines a resolution of r of weight w. We shall 
refer to an equation of Table I by citing its r value whenever we 
may do so without ambiguity. For example, equation 31, 
31 = —10a+6c—d, which defines a resolution of 31 of weight 
—5, is the first equation listed in Table I. We can readily verify 
these equations by (1). 

We write (7) in the form M,=A,a+B,b+Cic+2d+e+r+r’, 
A, Bi, fixed and R=r+r’, where 0<1r’ <1r,—r (the subscript 
f is used to denote that 7; is the equation immediately following 
r in Table I and possessing the property r;>r). Eliminating r 
between this equation and (9), we obtain 


M, = (Ay + A’)a + (By + B’)b a (Cy + C’)e 
+(2+D)d+(1+ 
We construct a tablette A =A, B=B,, C=C, by listing the r 
and w values of resolutions of Table I whose coefficients satisfy 
the inequalities A; +A’=0, B, +B’ =0, C,+C’=0 (it should be 


noted that for such resolutions, (10) gives decompositions of M, 
since all the resolutions in Table I satisfy the inequalities 


(10) 
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TABLE I* 
List OF EQUATIONS 
b c d e b 
31 -5 —10 0 6 -1 9 -3 8 1 4 —1 
2 2 148 18 0 20 -2 0 0 
62 —10 —20 2 0 |} 157 15 6 -1 12 -2 0 
66 O -3 -—2 2 4 -11]/ 158 46 12 27 5 3 -1 
70 48 31 8 i1 -2 0 || 169 34 26 9 -1 0 0 
72 18 9 1 171 4 —13 8 7 3 -1 
81 58 9 2 189 41 2 2-1 
17 6 3 192 20 13 8 
95 25 26 -1 0 0 0 || 200 29 16 9 5 -1 0 
97 -5 -13 -—2 8 3 -1/]]/ 210 8 —20 2 10 -2 0 
105 4-10 10 5 -1 0 jj 214 18 -—3 18 0 4 -1 
105 53 38 6 7 3 —1 |} 223 15 3 -—3 14 2 -1 
195 2 —1 || 231 24 6 9 11 -—2 0 
126 20 16 —1 6 -1 0 || 245 13 -13 18 6 3 -1 


2+D’=0,1+£’=0). From the r and w columns we form a new 
column, the W column, where W=(r;—1)—r+w. We obtain 
our tablette in its final form by deleting all equations r; for 
which w,>W. We denote the greatest W value in this tablette 
by G(A,, Bi, Ci). For example, upon constructing the tablette 
A=0, B=0, C=0 we get 


A=0, B=0, C=0 A=0 B=0 C 


=( 
r w W r w W 
0 0 40 115 32 74 
41 23 cu 158 46 76 
70 48 58 189 41 51 
81 58 60 200 29 59 
84 37 ST 231 24 48 
105 53 62 G=76 


By constructing eleven such tablettes, we obtain 

G(0, 0,0) = 76, G(3, 0,0) = 62,  G(15, 4,0) = 41, 
G(0, 1,0) = 60, G(5, 4,0) = 47, G(20, 0, 0) =39, 
G(0, 0,1) = 74, G(14,0,0) = 50, G(10, 0, 0) =57. 
G(0, 0, 2) = 64, G(20, 4,0) = 37, 


(11) 


* The method of obtaining these equations is analogous to that explained in 
Dickson’s paper on ninth powers in this Bulletin, vol. 40 (1934), pp. 487-493. 
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Because of the exclusive condition w;>W, the tablettes have 
the property (P), G(A2, Bz, C2) B,, Ci) if A,S Az, 
B, <= < C2. 

Let us now consider M,=R+A,a+B,b+Cic+2d+e. Since 
w(R) SG(A,, Bi, Ci), 


w(M,) G(A,, B,, C;). 


Dropping the subscripts we seek the maximum value H of the 
function A+B+C+3+G(A, B, C) in the range (6). From (11) 
and (P) it is evident that H 81. This result may be expressed 
as follows. 


LemMa 2. Every integer M, such that 2,460,866 < M <2,851,491, 
is a sum of 81 eighth powers. 


From this interval we ascend to the Hardy and Littlewood 
constant by employing two theorems of Dickson, Theorems 10 
and 12 in this Bulletin.* Using Theorem 10 and a table of 
eighth powers it was found that 102 eighth powers suffice for 
the enlarged interval from g=2d+e to Lo =2,235,617-10°. Ap 
plying Theorem 12, we obtain 


log L; = (8/7)*(log Lo + h) — h, h = — 8 log 8, 
h = 8.775280, log (log Lo + h) = 0.909806, log (8/7) = 0.057992. 


We take ¢=464; then loglog L;=27.818. Hence 566 eighth 
powers suffice from q to Ly. 

James showed that log, C=20-82", where C=C(s) is the 
Hardy and Littlewood constant, s is the number of eighth 
powers, and 


For s=566 we obtain log log C=27.762. Hence every integer 
>q is a sum of 566 eighth powers. It is evident from (8) that 
294 eighth powers suffice from 0 to d. Consequently 300 eighth 
powers suffice from 0 to 7d. Since q<7d, we have the following 
theorem. 


THEOREM. Every positive integer is a sum of 566 integral 
eighth powers. 


THE UNIVERSITY OF CHICAGO 


* Loc. cit., vol. 39 (1933), pp. 710-711. 
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SOME THEOREMS ON TENSOR DIFFERENTIAL 
INVARIANTS 


BY JACK LEVINE 


1. Introduction. In the theory of algebraic invariants there is 
a theorem which states that if an absolute invariant be written 
as the quotient of two relatively prime polynomials, then the 
numerator and denominator are relative invariants.* If we con- 
sider absolute scalar differential invariants of a metric (or affine) 
space, then it is possible to prove a similar theorem regarding 
them. In the course of the proof we give a new proof of the fact 
that in a relation of the form (2) the @ must be a power of 
the Jacobian of the coordinate transformation. (In the alge- 
braic theory the ui are of course constants.) This proof involves 
the use of the differential equations satisfied by the scalar.¢ In 
this proof it is not necessary to restrict B and ¢ to be poly- 
nomials in their arguments as is done in the usual proof of the 
corresponding theorem in the invariant theory. It is sufficient 
to assume that @ possesses first derivatives with respect to the 
ui and that B(g) is an analytic function of € in the neighbor- 
hood of «=0. We also extend the theorem to the case of tensor 
differential invariants of the form (5). 

2. Scalar Differential Invariants. We consider the differential 
invariants of a metric space V, with a quadratic form g;j;dx‘dx’. 


Let 


be an absolute scalar invariant of V, which we take to be ra- 
tional in its arguments. We can then write A in terms of the 
gi; and their extensions g;;,,...1, and we have 


03 ) 


A(gij; 03 ) 


* See, for example, H. W. Turnbull, The Theory of Determinants, Matrices, 
and Invariants, p. 277. 

¢ T. Y. Thomasand A. D. Michal, Differential invariants of relative quadratic 
differential forms, Annals of Mathematics, vol. 28 (1927), p. 679. 
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where B and C are polynomials in the g;; and their extensions. 
We may assume that B and C have no common factor. Now 


 B@iss 
= 
under an arbitrary coordinate transformation x—#, the barred 


g’s being the g’s in the (#) coordinate system. It is easily shown 
we must have* 


(2) ) = ), 
where u; =0x*/0z%’ and ¢ is a polynomial in the u’s. We now 
prove ¢ is a power of | x/d%|, thus showing B and C are relative 


scalars. 
Write (2) in the form 


= B(g), 
and consider the infinitesimal transformation 


xt = £'+ 


(1) 


We have 
( = ( dg ) 


As ¢ is a polynomial, so also is 6¢/du}, and on evaluating this 
last expression at €=0, we obtain a set of constants k/, so that 


(=) 
de Ox! 


Proceeding then as in the paper by Thomas and Michal, page 
679, we obtain the differential equations satisfied by B in the 
form 


(4) X.(p)B = RB. 


Now for any function G we havet 


* H. W. Turnbull, loc. cit. 
+t Thomas and Michal, loc. cit., p. 663. 
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and, in particular, for B we would have 

(Xt, Xn)B = — 6,XnB = — dskm). 
But also 

(Xi, Xm)B = Xi(kmB) — = B(kmks — = 0, 
so that 

— = 0, 

from which follows ki, =k6,, where k is a constant. Substituting 
in (4) we find that B is a relative scalar of weight k and* 
o= | dx/az|*. Similar results hold for C also. As stated in §1 we 


can prove a generalization of this result which we state as fol- 
lows. 


THEOREM 1. Given a function 


B( 
axt]’ 


with the law of transformation 


0°83; ) ( 0" ) 


where possesses first derivatives in the u’s and B(Z) is analytic 
in the neighborhood of «=0. Then ¢ is a power of the Jacobian and 
B is a relative scalar differential invariant. 


3. Tensor Differential Invariants. Consider the absolute ten- 

sor differential invariant with components of the form 
U;...5 kl,mq3 °° -) 
D(git;--- ) 

where the U’s and D are polynomials (with no common factor) 
in their arguments. Corresponding to (1) in the scalar case we 
have 


Fy 
= 


(6) 


* Thomas and Michal, loc. cit. 


= 
= 
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(P, Q having no common factor), where the primed U’s and D 
represent the result of replacing the barred g’s by their values 
in terms of the g’s and u’s in the expressions U-.-(g) and D(j). 


From (6) we see that Q is a factor of D and of Ut. pub +++ uh, 
and hence must be a function Q’(g), so that 

(8) D(g) = Q’(g)R(g). 

In (7) put uj = 5}; then 

(9) lg) = 8). 


Hence Q’(g) =const., since D and the U’s have no common fac- 
tor. Since D(g) and D’(g, u) are of the same degree in the g’s, 
it then follows from (6) that P(g, u) =P’(u), so that 

D(g) = $(u{)D(g). 


Hence as proved in the previous section for B, we have shown 
that D is a relative scalar of weight k, and therefore U3:::5 are the 
components of a relative tensor of weight k. We can also prove 
the following theorem. 


THEOREM 2. If the set of quantities 


OP 
Ox™ 


have the transformation law 


7 d a b 


then $ is a power of | ax/dz| and the T’s are components of a rela- 
live tensor invariant, tt being assumed that > possesses first deriva- 
tives in the u’s, and T(2) are analytic in the neighborhood of «=0. 


The proof is similar to that used for B of the previous sec- 
tion. Similar results to those obtained for metric scalar and ten- 
sor differential invariants hold for affine invariants. 


PRINCETON UNIVERSITY 
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A CERTAIN THREE-DIMENSIONAL CONTINUUM* 
BY W. T. REID 


In this note there is given an example of a bounded con- 
tinuum in three-dimensional euclidean space such that there 
exists a point A of M and subcontinua G, K;, and K2of M—A 
satisfying the following conditions: (1) M—K;,, (i=1, 2), is the 
sum of two mutually separated point sets each of which is con- 
nected ; (2) each point of G is separated from A in M by either 
K, or Kz; however, (3) there does not exist a point set consisting 
of a finite number of connected subsets of M and separating 
A from Gin M. 

This example was obtained in 1928 while the author was a 
student at the University of Texas. It answers in the negative 
for three-dimensional continua a certain question concerning 
continua proposed to the author by R. L. Moore at that time. 
R. E. Basyef has recently answered Moore’s question in the 
affirmative for plane continua. It is because of Basye’s result 
that the example of this note is now published. 

EXAMPLE. Using rectangular coordinates, consider the points 
pi=(1/2, 0, 0), po=(0, 0, 0), 0, 1), tfo=(0, 0, 1), 
s;=(1/2, —1,0), 59=(0, —1,0),and A =(0, 1,0), (¢=1,2,-- -). 
If P and Q are two points we shall denote by PQ the closed 
straight line segment from P to Q. The continuum M is now 
defined as [A pat where it is 
understood that the summation is with respect to m over the 
values n=0, 1, 2, - - -. The subcontinua G, K;, and Ke are de- 
fined as follows: G = +Sotot tot, Ki Patan ttohr, => PaSn 
It is seen that M—K,=U,+ V, where 
+ [soto —to] +5051 and [A Pn—Pn|. Similarly, 
+V, where [soto—so]+totr. Clearly U; and 
V, (¢=1, 2), are mutually separated point sets each of which 
is connected. Moreover, each point of G belongs to either U; 
or Us. However, where B,=),[Ap,—A ] 
[Pnsn—Sn]. Since (n,m=0,1, - - -), 


* Presented to the Society, February 23, 1935. 
t See this issue of this Bulletin, pp. 670-674. 
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the connected point sets B, and B,, have no point in common, 
there clearly exists no point set consisting of a finite number of 
connected subsets of M and separating G from A in M. 


THE UNIVERSITY OF CHICAGO 


A THEOREM ON PLANE CONTINUA* 
BY W. T. REID 
1. Introduction. In this paper the following theorem is proved. 


THEOREM. If M is a plane continuum, and K is a proper sub- 
continuum of M, then at least one component of M—K has a limit 
point in K. 


Two points sets are mutually separated if they are mutually 
exclusive and neither of them contains a limit point of the other. 
A point set is said to be connected if it is not the sum of two 
non-vacuous mutually separated point sets. A point set which 
is both connected and closed is a continuum. A component of 
a point set N is a connected subset of N which is not a proper 
subset of any other connected subset of N. The set of all points 
in the plane will be denoted by S.f 


2. Proof of the Theorem. If Misa bounded continuum and K is 
a proper subcontinuum of M, it is well known that every com- 
ponent of M—K has a limit point in K.{ If M is unbounded 
then it is no longer true that every component of M—K hasa 
limit point in K.§ 

If K is a bounded subcontinuum of an unbounded plane con- 
tinuum M, then the above theorem may be proved readily. For 


* Presented to the Society, December 28, 1934. The result of this paper 
was obtained in 1928, while the author was a student under R. L. Moore at 
the University of Texas. Recently both R. L. Moore and J. H. Roberts have 
proved results beyond that of the present paper and have suggested that I 
publish my original result. 

| These definitions are those customarily used in point set theory. See, for 
example, R. L. Moore, Foundations of Point Set Theory, Colloquium Publica- 
tions of this Society, vol. 13. For brevity, this treatise will be referred to as 
“Moore.” 

t See, for example, Moore, p. 24. 

§ See Moore, p. 25, example 2. 
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let J be a simple closed curve enclosing K. If D is the interior 
of J and M, denotes the component containing K of the subset 
of M belonging to D+-J, then M, is a bounded continuum con- 
taining K as a proper subcontinuum. As indicated above, if C, 
is acomponent of 14,—K, then C, has a limit point in K. Hence 
the component C of I—K which contains C, has a limit point 
in K, and the theorem is proved. 

Now suppose that K is an unbounded proper subcontinuum 
of the plane continuum M. If x is a point of M—K, the com- 
ponent of M—K containing x will be denoted by C-,. On the as- 
sumption that the above theorem is false, it is seen that each 
component C, is an unbounded continuum. It follows, there- 
fore, that M is the sum of a set of mutually exclusive unbounded 
continua consisting of K and the totality of components of 
M-—K. This set of mutually exclusive continua is necessarily 
uncountable in number.* 

We shall prove the preceding theorem by showing that the 
assumption that it is false leads to a contradiction. The proof 
depends upon the following auxiliary lemma which will be es- 
tablished in the next section. 


LemMaA 1. On the assumption that the above theorem is false, the 
components C, of M—K satisfy the following conditions: (a) for 
each component C,, M—C, is connected; (8) if x; and x2 are two 
points of M—K such that C,, and C,, are mutually exclusive, 
then there does not exist a simple continuous arc x\X2 such that 
X1X2— (x1 belongs to S—M. 


With the help of this lemma the proof is as follows. Let D 
be a complementary domain of K containing points of M—K. 
Since M is a continuum, it follows that D must contain infinitely 
many of the components C, of M—K. Let x; and x2 be points of 
M-—K belonging to D such that C,, and C,, are mutually ex- 
clusive, and consider a simple continuous arc x1x2 which belongs 
to D.¢ Let X denote the subset of points x of M—K such that 


* See R. L. Moore, Concerning the sum of a countable number of mutually 
exclusive continua in the plane, Fundamenta Mathematicae, vol. 6 (1924), pp. 
189-202. 

{ For a proof that such an arc exists, see R. L. Moore, On the foundations 
of plane analysis situs, Transactions of this Society, vol. 17 (1916), pp. 131-164, 
in particular, p. 137; see also, Moore, p. 86. 
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the component C, has a point in common with the arc x;%2. 
Now if X were connected, it would follow that C,, and C,, be- 
long to a single component of M—K, which is a contradiction. 
Hence X is the sum of two mutually separated point sets X, 
and X>2. Clearly, if x is a point of M—K belonging to a set X;, 
(i=1, 2), then every point of C, belongs to the same X;. Let 
Y;, («=1, 2), denote the set of points common to X; and the 
arc xx. The sets Y; are seen to be closed and mutually exclu- 
sive. There exists, therefore, a subarc x19X%20 of x1x2 such that 
belongs to Y; and x:o%20— (x10 +20) belongs to and 
therefore to S— M. This, however, is a contradiction to conclu- 
sion (8) of Lemma 1, since x19 and x29 belong to mutually ex- 
clusive components of M— K. We have proved that the assump- 
tion that our theorem is false leads to a contradiction. Hence the 
theorem is true. 


3. Proof of Lemma 1. Conclusion (a) of Lemma 1 has been 
established by Knaster and Kuratowski.* We shall prove con- 
clusion (8) by showing that the assumption that it is false leads 
to a contradiction. For suppose that there are two points x; 
and x2 of M—K such that C,, and C,, are mutually exclusive and 
there exists a simple continuous arc such that (x1 +42) 
belongs to S— M. Let D denote the complementary domain of 
the continuum K which contains C,,+C,,+:%2, and denote by 
M, the subset of M belonging to D. Since M is a continuum, 
M, is seen to contain infinitely many of the components C, of 
M-—K. Now let x’ be a point of @,—(C.,+C,,), and P a point 
of the arc x:x2 between x; and x2. Since x’ and P are both points 
of D, they are not separated by the continuum K. Moreover, 
since belongs to S—M and M—C,,, (4=1, 2), is 
connected in view of conclusion (a), there exists a complemen- 
tary domain D; of C,, containing the points P and x’, and hence 
the continuum C,,, (¢=1, 2), does not separate x’ and P. It 
follows, therefore, that K+(C,,+(C., does not separatet x’ and 
P, and hence there exists a simple continuous arc x’P belong- 


* See B. Knaster and C. Kuratowski, Sur les ensembles connexes, Funda- 
menta Mathematicae, vol. 2 (1921), pp. 206-255, in particular, p. 214, Theorem 
X. 

+ See B. Knaster and C. Kuratowski, Sur les continus non-bornés, Funda- 
menta Mathematicae, vol. 5 (1924), pp. 23-58, in particular, p. 35. 


| 
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ing to S—(K+C,,+C,,). Let O denote the first point of the arc 
x’P belonging to the arc x:x2, and x; the last point of the arc 
x’O belonging to M. The point O is seen to be between x; and x2 
on the arc x;x2, and x3 is a point of M—K such that C,, is dis- 
tinct from C,, and C,,. 

Now let H;, (j=1, 2, 3), denote the continuum consisting of 
the component C,, and the arc Ox;. Each of the three unbounded 
continua H; contains the point O, no two of them have in com- 
mon any point except O, and no one of them is separated by the 
omission of O. It then follows* that there is no complementary 
domain of the continuum H = H,+H2+ H; whose boundary con- 
tains a point, distinct from O, of each of the three continua Hj. 
IfDdenotesthe complementary domain of H containing the con- 
tinuum K, the boundary of D, aside from the point O, belongs 
to some two of the continua H;. For definiteness, suppose that 
the boundary of D is a subset of Hi +H». Let Dp denote the com- 
plementary domain of the continuum H,+//2 containing the 
continuum C,,, and denote by M, the subset of M belonging to 
Do. Since D is a complementary domain of H, the domain Do 
is distinct from the domain D and the point set My does not 
contain K. Moreover, since M is a continuum, infinitely many 
of the components C, belong to Mo. 

Now consider the closed point set N= Mo+C.,+C.,. If N is 
connected, we have a contradiction to the assumption that C-,,, 
(j=1, 2, 3), is a component of M—K. Suppose, on the other 
hand, that JN is not connected. It will now be shown that under 
this hypothesis the point set N is the sum of two mutually ex- 
clusive closed point sets N; and N2 such that N, contains both 
C,, and C,,. Since N is closed, the assumption that N is not con- 
nected implies that N= Nio+ Noo, where Nio and Neo are mu- 
tually exclusive closed point sets. It may be supposed without 


* For let A be a point of S—H and subject the plane to an inversion about 
A. If O, H;, (j=1, 2, 3), denote the images of O, H;, (j=1, 2, 3), under this 
inversion, then the bounded continua L;=H;+A satisfy the following condi- 
tions: each of the continua L; contains the distinct points A and O, no two of 
them have in common any point except A and O, and no one of them is sepa- 
rated by the omission of A +0. It then follows [see, for example, Moore, p. 291 | 
that there is no complementary domain of the continuum L=1,+L1.+L; 
whose boundary contains a point, distinct from A and O, of each of the three 
continua L;, (j=1, 2, 3). Since each complementary domain of L is the image of 
a complementary domain of H, the proof of the above stated result is complete. 
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loss of generality that Nio contains more than one of the com- 
ponents C, of M—K which belongs to N. If Nip contains both 
C,, and C,,, the property stated above is true for Ni= Mio, 
N2= Neo; if Nio contains neither C,, nor C,,, the property is 
true for Ni=Noo, Ne=Nio. Finally, suppose that (i, 7) is a 
permutation of (1, 2) such that Nip contains C,, and Neo con- 
tains C,;. Since Nip contains components of M—K distinct 
from C,,, the closed point set Nio is not connected. Therefore, 
Nio= NioitNioz, where and are mutually exclusive 
closed point sets and Ni: contains C,;. The above stated prop- 
erty is then true for Ni = Nini + Noo, Ne= Nie. Now since N, 
contains both C,, and C,,, we see that Nz is a subset of Mo 
and hence contains no limit point of —N. Consequently, N» 
and i —N-2 are mutually separated, which is a contradiction to 
the assumption that M is a continuum. 

On the assumption that conclusion (8) of Lemma 1 is false 
we have thus been led to a contradiction. Hence this conclusion 
is true, and Lemma 1 is established. 


4. Remarks. In a letter to me, J. H. Roberts has stated that 
he has discovered an example of a continuum in three-dimen- 
sional euclidean space showing that the result of the theorem 
of this paper does not hold when the condition that the con- 
tinuum M be a plane continuum is omitted. There still remains 
the interesting question as to whether or not the statement ob- 
tained by replacing in our theorem the word “component” by 
“maximal strongly connected subset”* is true. That this latter 
question may also be answered in the negative when the condi- 
tion that the continuum M be a plane continuum is omitted is 
a consequence of an example given by Knaster and Kuratowski 
of a three-dimensional indecomposable continuum each of whose 
composants is a continuum. f 


THE UNIVERSITY OF CHICAGO 


* A point set N is said to be strongly connected if for every two points x 
and y of N there exists a continuum which contains both x and y and which 
is a subset of N. A maximal strongly connected subset of N is a strongly con- 
nected subset of N which is not a proper subset of any other strongly connected 
subset of N. 

t See B. Knaster and C. Kuratowski, Sur les continus non-bornés, Funda- 
menta Mathematicae, vol. 5 (1924), pp. 23-58, §4. 
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THE CONSTANTS IN WARING’S PROBLEM 
FOR ODD POWERS 


BY R. D. JAMES 


1. Introduction. In a previous paper* the writer determined 
the constants in the Hardy-Littlewood analysis of Waring’s 
problem. It was then possible to obtain new universal theorems. 
The proof depended on the following results. 


THEOREM A. (Theorem 46, page 429, and (C), page 442 in 
paper T.) Let k= 6 be an integer; s an integer;a=1/k; 


d = [log (k — 1)/log 2]; D = (d + 2)(k — 1) — 24'4 011; 
k — 2) log2 — log k + log (k — 2 
log k — log (k — 1) 
D(s — 2) + k2*-*(1 + (1 — a)*-*) 
: 


(s— 2) — (k — 


Then every integer n>C, where log. C=20k*2", is a sum of s+Se 
kth powers. 


The proof of universal theorems was greatly simplified by 
Dickson’s new method. Using Theorem A and very short tables 
he proved thatf every integer is a sum of 259 seventh powers, 
575 eighth powers, 981 ninth powers, 10711 twelfth powers, - - - 
All these results are considerably better than those obtained 
by algebraic methods. 

In another paper{ the writer obtained new asymptotic re- 
sults for odd powers. It is the purpose of the present note to 
point out that the constants of this paper may also be evaluated 
by the methods of paper T. This leads to still better universal 


* Transactions of this Society, vol. 36 (1934), pp. 395-444. This will be re- 
ferred to as paper T. 

{ This Bulletin, vol. 39 (1933), Theorems 15-18, pp. 713-714. 

¢ Proceedings of the London Mathematical Society, (2), vol. 37 (1934), 
pp. 257-291. This will be referred to as paper P. 
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theorems for odd powers. Corresponding to Theorem A we have 
the following theorem. 


THEOREM B. Let k=7 be an odd integer; t=k2*~* an integer; 
a=1/k;b=1/(k—-1); 
d= [log (k—1)/log 2]; d= [log (k—2)/log 2]; 
where was defined in Theorem A; 
 (2k—1)(k—1) + 


v=4 when k 1s divisible by 15, v=2 in all other cases. Then every 
integer n>C’, where log.C’=25k*2", is a sum of 2t+52+2v 
+2k—1 kth powers. 


The use of this theorem instead of Theorem A enabled Dick- 
son* to prove that every integer is a sum of 4425 eleventh 
powers. The Hardy-Littlewood result showed only that every 
sufficiently large integer was a sum of 4687 eleventh powers. 


2. Changes Required for Cio. In order to obtain Theorem B 
there is one change which must be made in paper P. The con- 
stant Cs9 which occurs in the proof of Theorem 3 (page 289) 
is evaluated in precisely the same way as the constant C in paper 
T, and we shall not repeat the discussion. However, in Lemma 
33 (page 290) the constant Cio depends on a prime number 
theory constant which has not been evaluated. Accordingly we 
replace Lemma 33 by the following lemma. 


LemMMA A. Define k, t,and vas in Theorem B. Let 
p=3 when k is prime to 6, 
p=5 when k is divisible by 3 and prime to 10, 
p=2 in all other cases. 
Then given any integer w>p‘**©” there exist 2v kth powers 
W, We, * , Wey and integers and W such that 


a=1 


* Journal of the London Mathematical Society, vol. 9 (1934), pp. 201- 
206. 
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where 
(2* — 4) <ws(2'- 2) | 
The integer W satisfies the inequalities (8-32), for the value 


N= p*t2+* on page 288 of paper P. For by (2-31), page 258, 
we have 


2t(P(M) + 1) + N*-! 2¢(P(BN) + 1) + NF 
= 2#{(1 + B)*+ (1 — B)* — 2}N*1+4 NE 
< — 4 NE < — 
since we have 2t <2?* = 
3. Evaluation of C’. We shall now prove Lemma A and then 
show that the constant C’ in Theorem B is equal to 


k/(k—1) 


(2) pp + (2¢ + 2*)(2* — 


Knowing log. Cog (evaluated by the methods of paper T) we 
can therefore find log, C’. The result is log.C’ < 252". 
To prove Lemma A, we require two additional lemmas. 


LEMMA B. Let m(21) and E be integers; P=0, Q=0, and 
P+(m—1)*SE<Q. Then there exist at least m consecutive in- 
tegers i320, (8@=1, 2,---, m), which satisfy the inequalities 


(3) P< E-— if < 
Proor. Let 5=(E—P)"/*. Then [6]=m—1, and we have 
P=E-#8 [5]* — m+ 1)* 

P + 6* — ([6] —m+1)* P+ k(6 — [5] + m— 

P+ = P+ km(E — 

P+ kmQt-v/E, 


IIA 


The integers [5]—m+1, [6]—m+2,---, [6] are all [0 and 
satisfy the inequalities (3). 


Lema C. Define v and p as before. Let e=2 and F be integers; 
P2=0,Q20 and 


(4) P+ FSC. 


Then there exist integers h2,- , hv», and W such that 


= 
= 
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F= > hi + pw, 
a=1 


where 


WS + RODE, 


ProorF. It is known* that there is a solution of the congruence 
F= = h& (mod p*) 
a=1 
for v and pas defined. Hence there is an integer W such that 
(5) + pW. 
a=] 


For a=1, 2,---,v—1 we choose fz so that OSha<p*. Then 
from (4), (5) we have P+p%*< ht <Q. In Lemma B 
with m= p*, E= one of the p* consecutive integers ig 
must be congruent to hk, modulo p*, and we take h, equal to 
this particular is. It follows from Lemma B that 


PsF- = hE SP + 
a=1 
and then from (5) that 
< W S + RODE, 
The proof of Lemma A now proceeds as follows: Let 
(6) Q(u) = (2¢+ etn) 4 
Since w> and we have 
> (2t + 4 = (2), 


Thus for every integer w> p+) there is an integer 4=2 such 
that 


(7) Qu) Sw<Qu+ 1). 


* See E. Landau, Vorlesungen iiber Zahlentheorie, vol. 1, 1927, Theorems 291 
and 301. 


— 
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Now 


O(u + 1) — (2¢ + 2h) 4 yp 
— < (2t + plato! 
Hence we have 


— < w — 


(8) 
Qe 4 < qy — 


< (2/k) pk 2) 


As in the proof of LemmaC, with F=w—2tp “+” e=(k+2)k, 
there is a solution of 


v 


a=1 
with O < w, < Hence 
a=1 
where from (8) we have 
Since 
this gives 
(2% — 4) 4 < < (2/k) 
Next, by Lemma C, with F=H, e=uyk, 


there exist integers he, , hk», and W such that 
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(10) H= + pw, 


a=1 
where 
(2* — 4) <ws(2'- 4) 
(11) + h(2/k) 


(2* — 4) < W < — 2)pletetw 


Hence from (9) and (10) we obtain 
a=1 a=1 


where W satisfies the inequalities (11). This proves Lemma A. 


It remains to show how the constant C’ of Theorem B is ob- 
tained. Proceeding as in §9 of paper P with s=2k—1, 5 =44+& 
= 5, but using Lemma A instead of Lemma 33, we prove that 
w is asum of 


(2t + se + 20 + 2k — 1) 


kth powers, provided that W> Co. We shall have W> Cop if u is 
chosen so that 
From (6), (7) this inequality holds if 
w > 4 + 24) (24 — 
which proves (2). 
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GENERAL SOLUTION OF THE PROBLEM OF 
ELASTOSTATICS OF AN n-DIMENSIONAL 
HOMOGENEOUS ISOTROPIC SOLID IN 
AN n-DIMENSIONAL SPACE 


BY H. M. WESTERGAARD 


1. Introduction. Dealing with the important case of a three- 
dimensional solid subject to constant body forces (such as 
gravity) B. Galerkin* expressed the stresses and the displace- 
ments in terms of three functions, governed by the fourth- 
order equation AAf=const., and mutually independent except 
through the boundary conditions. He has demonstrated the 
fruitfulness of his method in later papers. f 

It is profitable to interpret Galerkin’s three functions as com- 
ponents of a vector. Simplicity is gained and significance is 
added by doing this. It is proposed to call this vector the 
Galerkin vector. Its nature is such that only a slight amount 
of complexity is added in the general derivations by consider- 
ing an m-dimensional space. 


2. Notation. Let the following notation be used. 
11, 12, °° tp, vectors in directions 
perpendicular to one another; m¥p. 
+ toxe +7,x,=radius vector drawn from 
the origin to any point; the point is called point R. 
+ ink + +2,£, =displacement =increment of 
R. The point R moves to the position R+¢9; 9 is assumed small. 
P=i;,P; + + +i,P, =force. 
K=i,K,+ - - - +i,K,=body force which is distrib- 


* B. Galerkin, Contribution a la solution générale du probléme de la théorie 
de Il’ élasticité dans le cas de trois dimensions, Comptes Rendus, vol. 190 (1930), 
p. 1047; Contribution a l’investigation des tensions et des déformations d'un corps 
élastique isotrope (in Russian), Comptes Rendus de |’ Académie des Sciences de 
l'URSS, (1930), p. 353. 

+ Comptes Rendus, vol. 193 (1931), p. 568; vol. 194 (1932), p. 1440; vol. 
195 (1932), p. 858; and papers in Russian: Comptes Rendus de |’ Académie des 
Sciences de l’URSS, (1931), p. 273 and p. 281; Messenger of Mechanics and 
Applied Mathematics, Leningrad, vol. 1 (1931), p. 49; Transactions of the 
Scientific Research Institute of Hydrotechnics, vol. 10 (1933), p. 5. 
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uted through the solid, per unit of magnitude of the region con- 
sidered ; of the dimension force times distance ~; a function of R. 

S,=section defined by x,,=const. The section is called a 
front face if the part of the solid dealt with lies on the side of 
smaller values of x,,; and a back face if the part dealt with lies 
on the side of greater values of Xm. 

Sm = vector representing internal force or stress on a small 
region of the front face S,, at point R, per unit of magnitude 
of this region; of the dimension force times distance~"*}; 
and—s»=stress on the same region of the back face Sn. 

¢m=component of s,, in the direction 7,,; normal stress in 
the direction of 7. 

T mp =component of s,, in the direction of i,, (bm) ; shearing 
stress on the front face S,, in the direction 7). 

=0,+0.+ --- +o,=bulk stress. 

E=Young’s modulus of elasticity (equation (6)). 

G=modulus of elasticity in shear (equation (7)). 

u=Poisson’s ratio of lateral contraction (equation (6)). 

--- +71,X,=Galerkin vector. 


OX OXn 
aX aX, 
div F=—— 
OXe Xn 
3? 0? 
A=div V= + 
Oxe 0x2 


3. Equilibrium. A set of forces P applied at the points R is 
in equilibrium when the n conditions 


(1) > Pn = 0, 
and the n(m—1)/2 conditions 
(2) — Cr) = — Cu) 


are satisfied, each sum including all the force components in 
the single direction of either 7,, or ip, C, and C, being constants 
which are interpreted as coordinates in the directions of i, and 
i, of an arbitrary fixed point C. 

Consider an element of the solid bounded by back faces 
Si, So, --- , S, intersecting at the point R, and by front faces 


= 
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S;, So,- +--+, S, intersecting at the point R+dR. With all the 
forces included which act on this element, and with C at the 
center of the element, equation (2) leads to the n(n — 1) /2 condi- 
tions 


(3) = 
Equation (1) leads to the ” conditions 
P 
+2 


Ohm 
excl. m 


(4) 


+ Kn = 0. 


Because of equation (3), equation (4) may be rewritten in the 
form 


(5) div 5m + Kn = 0. 


4. Elasticity. Hooke’s law of stresses and deformations is 
stated in the following form. The strain in the direction of 7,, is 
ity 


6 = m—-—9, 
(6) OXm E E 


and the detrusion in the directions of 7, and 7, is 


(7) 


ax, 


With the constants E and p given, only one value of G leads to 
isotropy. The value 


(8) 


E 
G = —— 
2(1 + 
represents isotropy in the cases n=2 and n=3, and is assigned 
to G here. That this value actually represents isotropy for any 


value of ” is concluded from the form of equation (13), which 
is derived from equations (5) to (8). Equation (6) gives 


1—(n-1 
(9) 
E 
which, with (8), permits the rewriting of (6) in the form 
Oxm 1—(n—1)p 


= 
— 
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Equations (7) and (10) lead to the following formula for the 
resultant stress on the front face S,,: 


OXm 1— (n—1)p 
Substitution from equation (11) in equation (5) gives 
1 — (n—1)p G 
or, in vector form, 
(13) Vdiv)o + —=0. 
1— (n—1)p G 
Isotropy is attained because equation (13) is independent of 


the orientation of the axes of coordinates. 


5. The Galerkin Vector. The general solution of equation (13) 
is the general solution of the problem. A difficulty arises from 
the interdependence of the components. This difficulty is over- 
come by introducing the Galerkin vector F. 

The displacement is expressed as 


1 
14 = — (cA — V div)F 
(14) 
in which ¢ is a constant yet to be selected. By substituting » 
from equation (14) in equation (13) and noting that 
(15) AV div = VdivA = V div V div, 


it is found that the terms containing the combined operators 
shown in equations (15) disappear when 


(16) 2(1 — (m — 2)u) 
1 — (n — 3)p 


Then equations (14) and (13) become 


1 2(1 — (m — 2)p) 
2GL 1 — (nm — 3)p 
1— (n — 3)p 


(18) = — 


— 
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The general solution of equation (18) defines the general solu- 
tion for g through equation (17). 


6. Stresses. The vector F and its components X1, X2,---,Xn 
lend themselves to expressions for the stresses. Equation (17) 
gives 


and 


1—(n—1)p 
(20) divo = A div F. 
2(1 — (n — 3)u)G 


Substitution from equations (19) and (20) in equations (10), 
(7), (9), and (11) leads to the formulas 


2(1 — (n — 2)p) 


Cn = 
1— (n— 3 
(21) ( \u 
+ Jaiv F, 
1 — (n — 3)p 
1—(n—2 OAX m 0? 
1 — (mn — 3)uL Ox, OXm OXmOXy 
1+ 
(23) @= A div F, 
1— (xn — 3)p 
and 
1— (n—2 0 
Sm = | vax + — aF| 
1 — (n — 3)p 
(24) 


+ v |aiv F. 
1— (x — OXm 
The form of equations (17) and (18) shows that F is inde- 
pendent of the orientation of the axes of coordinates. It follows 


that equations (19) to (24) continue to apply with the same F 
after a re-orientation of the axes of coordinates. 


UNIVERSITY OF ILLINOIS 
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SOME BESSEL FUNCTION EXPANSIONS 


BY W. H. WISE 


This is a formal derivation of some asymptotic Bessel func- 
tion expansions. An interesting feature of the derivation is that 
it makes use of Heaviside’s generalized exponential series.* The 
chief results arrived at, the Hankel function expansions (4) 
and (6), are generalizations of formulas (5) and (8) on page 141 
in Watson’s Theory of Bessel Functions in the same sense that 
Heaviside’s generalized exponential series is a generalization 
of the ordinary exponential series. 

In the course of some work on dipole radiation the author 
stumbled onto a particular case of the following relation 
(1) Hy — a)" ) = — 

—») 

In the particular case where v is 1/2, a is in the fourth quad- 
rant and 2 is in the first quadrant and has a large real part. Ex- 
cept for a missing common factor, the left-hand side in the 
particular case is half the Zenneck wave component of the wave 
function (as measured at the surface of the earth) for a 
vertical dipole centered on the interface between air and earth, 
the descending half of the series is the negative of the sky wave 
component of the wave function and the ascending part of the 
series is the convergent expansion for half the Zenneck wave 
component plus the sky wave component. 

The suggestion immediately offered itself that (1) should 
hold for all values of v, not just for »=1/2. 

If we assume for the moment that (1) holds for all v, and 
differentiate with respect to z, we get 


(1) 


Hy (2(1 — a)" (2) 
—1 (qz/2)"-" [2(n — v) 
= v\e 1 


* See John R. Carson, Transactions of this Society, vol. 31 (1929), pp. 782- 
792. 
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or 
(2) Hy (2(1—a)") = (1 — 


By use of the recurrence relation for Bessel functions, 
2p 
Zp+i(z) = — Z,-1(z), 


we get from (1) and (2) 


(az/2)"” 


2 
n=—o — v) 


n=0 (az/2)"-” a) 
= 


2 
vA 


(3) 


II (n v) 
(1) 


I(n — v — 1) 


n=00 (az/2)"-” 
(1 — a) 


Il 


and it is now more or less evident that in general 


n=—o Il(n — v) 


Equation (4) is one of a pair of similar formulas. If, in place 

of (1), we start out with the idea that 
(1) 1/2 — 
(5) Ho (ita) )= 
— v) 

then steps identical with those taken in going from (1) to (4) 
lead us to 
a) 


(6) =(1+a)" >> 
v) 


for integral values of m. 
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On page 180 in Watson’s Theory of Bessel Functions we find 
the relation 


(7) (z ) = - f ei cosh w— Pedy, 
—wo—iz/2 


good for all values of » and for | arg | <x/2. 
By setting (7) in the right-hand side of (6), we get 


n==0 (az/ v 
(a) (1+ a)"?>> dw, 


U(n—v) in 
az n—-v 
( 
n= 


1 
(b) (1 + a)™!2 fe cosh w+ m(w+ir/2) dw. 
ix — v) 


Now it is known that 


II(n—v) 


is a formal series representation for e*; it is Heaviside’s gener- 
alized exponential series. So the series in the integrand can 
be summed. The integral now is 


1 az 
(1 + a)™/2 wtm(wtirl2) exp evtixl2 \day 
in 


1 4 
(c) (1 + a)™/2 — fe cosh > — (—) 
in 


n=0 n! 2 


n=0 


The right-hand side of (4) may be summed in the same way. 

The series in (b) is a good representation for exp ((az/2)e’t '*/*) 
over only a part of the integration range, but, since the series 
can be recognized where it is valid, its sum may be used 
where the series is not valid. Because the series in (b) is a good 
representation for exp ((az/2)e’+‘*/?) over only a part of the 
integration range the integral (b) is meaningless except as it is 
interpreted as (a) or (c). This formal summation of the right- 
hand sides of (4) and (6) is therefore not entirely satisfactory. 
The following argument will be more convincing and inciden- 
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tally indicates that by the application of sufficient ingenuity 
the steps from (a) to (b) and (b) to (c) could be justified. 

Since the right-hand side of (8) is a solution of Bessel’s equa- 
tion it must be possible to determine constants A and B such 
that 


AH. (2(1 — + BJa(2(1 — a)””) 


(8) m2 (a2/2)"" 
= (1 — a) ery 


The right-hand side of (8) and the Bessel functions on the left 
satisfy the partial differential equation 


It follows that A and B are independent of a. 

To see that A =1 and B=0, let z= and a—0 while —azi/2 
keeps some large constant value such that Heaviside’s gener- 
alized exponential series is valid. Then each H(z) may be re- 
placed by the first term of the expansion 


1/2 2 
(sz) (=) et (z—-(2pt1) 4/4) (1 


1!82 


4p? — 1)(4p? — 3? 
218222 


and the principal terms in (8) are, after canceling the common 
factor (2/(mz) )'/2, 


: [ ( a 2m + 1 
7/4] 4. B (1 — r| 
2 4 


(— a2i/2)*—” 


ei(z—(2m+1) /4) 
(9) ae v) 


It follows that A =1 and B=0. 


If the H,?,,,,(z) in (8) are replaced by J,_,:m(z), then the 
right-hand side of (9) is replaced by 


ou ou 
21 4) +s — = 0. 
0a 
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| 2(n —v-+m)+1 | 
——— cos] 2 — 


cos 
n=—o — v) 4 
((—azi/2)"~” ei (2—(2m+1) 
(azi/2)"—” #/4) 
— v) 2 } 


1 1 
= — gilz(1—a/2)—(2m+1) + — 


| ( a 2m + 1 
= cos | z 
2 4 


and we must conclude that A =0 and B=1. 
The differential equation satisfied by 


y= 
m/2 (az/2)"~” a) 
II(n — v) 
is 
m? 
y’+- -«-")y 
(10) 


= a(1 — a)”/? 
N — 1 —»p) 


This differential equation has two particular integrals, 

ty and — )>-*"' and they differ by the complementary func- 
tion H‘)(z(1—a)"/?). The divergent particular integral and the 
complementary function are inextricably tangled up in the 
convergent particular integral. 

If we choose a sector of the complex plane where H‘)(z) 
vanishes as z goes to infinity we can see by inspection of the 
differential equation that as z goes to infinity the particular 
integral may behave in either one of two different ways. It 
may approach zero as does the right-hand side of the differential 
equation. If it does not approach zero as does the right-hand 


2 2 


1935-] BESSEL FUNCTION EXPANSIONS 705 


side, it must be approaching a Bessel function in such a way 
that it does not completely satisfy Bessel’s equation, but does 
so with a small and rapidly vanishing residue, the right-hand 
side of (10). The divergent particular integral must be the one 
that vanishes like the right-hand side. The convergent particu- 
lar integral must be the one that eventually behaves like a Bes- 
sel function. 

If H™ is replaced by J the reasoning must be altered to 
specify that the imaginary part of z remains finite while the 
real part of z goes to infinity. 

Formulas (4) and (6) are generalizations of formulas (5) and 
(8) on page 141 in Watson. The formulas in Watson are ob- 
tained by making v zero, or any integer. 

It should be noted that in verifying (4) and (6) it was not 
necessary to require that m be an integer. However, the re- 
strictions on Watson’s vy must apply to m also. 

If in (4) we put m=—1/2 and replace H by J, we get, 
since cos x= 


n= (az/2)"” 


— v) 


1/2 
(11) (=) cos — a)!/?) = I n—»-1/2(2). 


If in (6) we put m= —1/2 and replace H™ by J, we get, since 
sin x = 
n=0 (az/2)"-” 


2\1/2 
(12) (=) sin (2(1 + = 


n=—o Il(n — v) 


—(n—v—1/2) (z) e 


a 


If we put az/2 =#, then (11) and (12) are 


2 1/2 n=20 
(13) (<) cos (32 — 2zt)!/2 = J 


F\1/2 n—v 
(14) (=) sin (2? = — J _(n—»-1/2)(2). 

Equations (13) and (14) are generalizations of the Glaisher 
formulas (3) and (4) on page 140 in Watson. The Glaisher 
formulas are obtained by making vy zero, or any integer. 

If in (11) we put y=1/2, we get 


9 


4 2 1/2 x (az/2)"-! 2 
15) cos (s(1 — a)!/?) = J,_1(2) 
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or 
252 
a'!? cos (2(1 — a)"/2) = azJo(z) + (“ 1) Ji(z) 
+(= + ( a‘z4 
+( 140) + 
1-3-5-7-9 


It will be observed that the left-hand side of (16) is an even 
function of z while the right-hand side is an odd function of (z). 
This merely indicates that larg z| <a/2 and is not surprising. 

As a matter of curiosity (16) was computed with z=10 and 
a=1. The ascending part of the series, from Jo to Ji; inclusive, is 
1.0668. The descending part of the series, from — J; on, is 


—0.0435 —0.0255 —0.0018 +0.0033 +0.0025 -+0.0001 
—0.0023 —0.0043 —0.0059 —0.0072 —0.0081 —0.0087 


and so on. The sum of the first seven terms is —0.0671, which 
is a better check than one would expect from these figures 
The decision as to where to terminate the asymptotic series is 
complicated by the two changes in sign. 
If in (12) we put y=1/2, we get 


2 1/2 
17) (=) sin + = 


TZ 
9_9 
sin + a)"/2) = azJo(2z) (= 1) Ji(z) 
(18) 


1 a‘z! 1-3 
1-3-5 az 1-3-5-7 az? 


With z=10 and a=1, and so using the terms computed for 
(16), we find that the left-hand side of (18) is 1.0000, the ascend- 
ing part of the series is 0.9786, the asymptotic part is 0.0230, 
and the right side is 1.0016. 

Since both sides of (18) are odd functions of z, this formula 
is only restricted to large absolute values of z. 
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ON APPROXIMATION BY POLYNOMIALS TO A 
FUNCTION ANALYTIC IN A SIMPLY 
CONNECTED REGION* 


BY O. J. FARRELL 


In a previous paperf the writer studied expansions in series 
of polynomials of a function f(z) analytic in a limited simply 
connected region G where f(z) is known either to be bounded 
in G or such that the double integral over G of the pth power 
(p>0) of the modulus of f(s) exists.{ The present note contains 
an extension of each of the two theorems obtained in the earlier 
paper. The extended theorems now read as follows. 


THEOREM A. Let G be a limited simply connected region of the 
z plane. Then in order that corresponding to every function f(z) 
analytic and bounded in G there shall exist a sequence of poly- 
nomials { Pa(z) } which converge to f(z) in G as n—& and at 
the same time such that 


(1) lim [| p,(z) |, inG] S bound [| f(z)|, z inG], 


ns 
it is necessary and sufficient that the boundary of G be also the 
boundary of an infinite region. 


THEOREM B. In the 2 plane let G be a limited simply connected 
region whose boundary is also the boundary of an infinite region. 
Let f(z) be analytic in G and such that 


(2) ff isoles, (p > 0), 


exists. Then there exists a sequence of polynomials { p,(z)} such 
that 


* Presented to the Society, September 4, 1934. 

+ This Bulletin, vol. 40 (1934), pp. 908-914. 

t The writer is indebted to Professor J. L. Walsh for having suggested a 
study of these two problems and also to Professor Torsten Carleman for send- 
ing a reprint of his paper on approximation to analytic functions by linear ag- 
gregates of prescribed powers (Arkiv fér Matematik, Astronomi och Fysik, 
vol. 17 (1923), pp. 1-30). 


= 
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(3) lim ff — = 0. 


It will be noticed that in the first of these theorems we no 
longer say that there exist polynomials { p,(z)} which converge 
to f(z) continuously in G, but merely that there exist poly- 
nomials { p,(z)} which converge to f(z) in G. For if polynomials 
| Pn(z)} converge to f(z) in G so that (1) holds, these poly- 
nomials are uniformly bounded in G and thus form in G a normal 
family of analytic functions from which can be chosen a sub- 
sequence converging to f(z) continuously in G. Hence, whenever 
there exists a sequence | p,(z) | converging to f(z) in G so that (1) 
holds, there exists also a sequence which fulfills (1) and con- 
verges to f(z) continuously in G. It will be seen too that in the 
second theorem we no longer say that there exist polynomials 
| p,(z)} which converge to f(z) continuously in G and for which 
(3) holds, but merely that there exist polynomials { pa(z)} for 
which (3) holds. This is because we have since found in the 
literature a lemma by Walsh* giving assurance that if (3) holds, 
the polynomials { p,(z) } do converge to f(z) continuously in G, 
so that specific mention of the convergence may be omitted. 

The proof of Theorems A and B requires only a slight modifi- 
cation of the proof of the two corresponding theorems in the 
previous paper. This modification is brought about by observing 
that if Gis any limited simply connected region whose boundary 
also bounds an infinite region, then there exists a sequence of 
regions }G,}, each of which is a Jordan region lying interior to 
its predecessor and which are all such that the sequence {G,} 
converges to G as kernel.7 If we use such a sequence of regions 
|G,}, the proofs of Theorems 1 and 2 of the previous paper 
apply to Theorems A and B, respectively, of the present note. 
It is to be remarked, however, that uniform approximation to 
f,(z) or F,(z) in G by a polynomial with arbitrarily small error 
does not now follow directly from the theorem of Walsh that was 


* Transactions of this Society, vol. 33 (1931), pp. 370-388, Lemma on p. 
387. 

+ Compare Carathéodory, Mathematische Annalen, vol. 72 (1912), pp. 
107-144, Chapter 3; or Walsh, Transactions of this Society, vol. 32 (1930), 
pp. 335-390, proof of Theorem X. 
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used before, but does follow indirectly from it, since f,(z) or 
F,(z), being analytic in the closed Jordan region G,4;, can be 
uniformly approximated with arbitrarily small error in G,s1 by 
a polynomial, and hence can be so approximated in G. In- 
deed, Runge’s classical theorem on polynomial approximation 
could be applied here and for that matter could have been used 
in the previous paper. For the closed region G is interior to every 
region G, and hence the function f,(z) or F,(z) can be approxi- 
mated as closely as desired in G by a polynomial in z. 

The proof of Theorem B and the proof of the sufficiency of the 
condition of Theorem A are the same from this point on as for 
the corresponding theorems in the earlier paper. And the proof 
of the necessity of the condition of Theorem A is also contained 
there in §5 as “A Remark on Theorem 1.” 

The writer hopes in a forthcoming paper to be able to deter- 
mine the most general type of limited simply connected region 
to which Theorem B can be extended. That this theorem does not 
hold for an arbitrary finite simply connected region is shown by 
the following simple example. 

Let G be taken as the region bounded by the two circles 
| | =b, b>a, and by the line segment a<zSb. Let 
f(z) =1/z. Denote by Q the doubly connected region bounded 
by the two circles. If now there existed a positive number p 
together with a sequence of polynomials {,(z)} such that 


lim ff | p,(z) — 1/2|"dS = 0, 
n— 2 G 

it would follow that 

(4) lim ff | pa(z) — 1/2|"dS = 0. 
Q 


Consequently the polynomials { p»(z) } would converge* to 1/z 
in Q and the convergence would be uniform on any closed point 
set lying in Q, say on a circle | z| =c,a<c<b. Hence, the poly- 
nomials {p,(z)} would converge uniformly on and within the 
circle |z| =c to a limit function analytic within this circle. But 


* This convergence would follow by the lemma of Walsh to which reference 
was made above in the paragraph immediately following the statement of 
Theorem B. 
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such a function could not be equal to 1/z in the ring region be- 
tween the circles | z| =a and |s| =c. 

The main feature of this example is the use of the lemma of 
Walsh whereby we know that if in a finite region R we have a 
sequence of polynomials { p,(z)} for which 


lim (p>), 
R 

where f(z) isa given function analytic in R, then the polynomials 
{ p»(z) } converge to f(z) continuously in R. The converse is not 
always true, as was shown by an example in §4 of our previous 
paper. There is, however, a qualified form of converse which 
does hold for an arbitrary limited region and for an arbitrary 
function analytic therein. We may state this result as follows. 


THEOREM C. Let R be a limited region of the z plane, and let 
f(z) be analytic in R and such that 
lrores, (p > 0), 
R 


exists. If polynomials { p,(z) } exist which converge to f(z) continu- 
ously in R and for which 


lin f f | p»(z) |"dS = f f | f(z) |"as, 
R R 

tim ff | — pals)|as = 0. 

R 


The proof of this theorem is already contained in the latter 
part of the proof of Theorem 2 in our previous paper. 

We close this note with a result closely connected with 
Theorem A. 


then 


THEOREM D. Let G denote a limited simply connected region 
whose boundary does not bound an infinite region. Let B denote 
the boundary of the infinite region among the regions into which 
the closed region G separates the plane, and let T denote the region 
(also simply connected) consisting of all the points which can be 
joined to an arbitrary fixed point of G by a Jordan arc containing 
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no point of B. Let f(z) be analytic and bounded in G. A necessary 
and sufficient condition for the existence of polynomials { pn(z) 
which converge to f(z) in G so that (1) holds is that there exist a 
function analytic and bounded in T and equal to f(z) in G. 


The proof of this theorem is much the same as for Theorem 
A taken together with the remark of §5 in the earlier paper and 
is therefore omitted. 

The conclusion of Theorem D simply means of course that 
f(z) shall be analytically extensible throughout YT. 


Union COLLEGE 


A GENERALIZED PARSEVAL’S RELATION 
BY E. S. QUADE 


A function ¢(x) which is non-negative, convex, and satisfies 
the conditions = 0 and (¢(x)/x)—* as will be called 
a Young’s function. Given such a function $(x), a second func- 
tion, ¥(x), with the same properties can be found such that 
Young’s inequality, ab <¢(a) +y(b), holds for every a,b 20. The 
functions ¢(x) and ¥(x) are then said to be complementary in 
the sense of Young.f 

If x(£) is such that S| x| )dt exists, x(t) is said to belong to 
the space L,(a, b). This space is not necessarily linear.{ For 
this reason we denote by L;*(a, b) the class of all functions x(f), 
ast<b, such that the product x(f)y(t) is integrable for every 
y(t)eLy(a, b). If we put 


Il 


| b | 
sup | x(t) y(t)dt, 
y a | 


for all y(t) with 


Py 


b 
4, 


then L* is a linear metric, and complete space.§ A function 


+ W. H. Young, Proceedings Royal Society, (A), vol. 87 (1912), pp. 225- 
229. 

t W. Orlicz, Uber eine gewisse Klasse von Riumen vom Typus B, Bulletin, 
Académie Polonaise, (A), Cracovie (1932). 

§ A. Zygmund, Trigonometrical Series, 1935, pp. 95-100. 
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x(t)¢eL* can be approximated by its Féjer polynomials; that 
is, given e€>0O, there exists an No(e) such that 

|| x S €, n Noe), 
where go, =¢,(t; x) is the mth (C, 1) mean of the Fourier series 
of x(t). Since ¢,(t; x) is continuous, there exists a step function 
w,(t) which is a linear combination of simple step functions such 
that | o,(¢; x) — w,(t)| ¢fora<i<b. Then 


x — — + [lon — 


IIA 


b ] 
e+ sup| f [o,(t) — | 
y |i a 


Se+ «(sup f | | it) 
y a 


< €[o(1)(b — a) + 1] = Me, 


A 


where JM is independent of x(¢). This implies that the set of 
simple step functions is a fundamental set in the space L;*. 

In the following we take (6, 1) as the fundamental interval 
and {K,(s, t)} to be a sequence of measurable kernels defined 
on the unit square. 


THEOREM 1. Necessary and sufficient conditions that 


1 1 1 
f visas f K,,(s, t)x(t)dt x(s)y(s)ds 
0 0 0 


for every pair x(t)eL*, y(s)eL¥ are as follows. 
(1) If 6=(a, b) and +=(a, B) are any subintervals of (0, 1), 
then 


5 B 
f ds K,,(s, t)dt — meas (6-7); 
1] 1 


(2) f K,,(s, t)x(t)dt| < M||x\\,, 
0 v 


where M is independent of n.t 


+ A. Zygmund, loc. cit., p. 107 (14). 

t For analogous results in some more special cases we refer to J. C. Burhill, 
Strong and weak convergence of functions of general type, Proceedings London 
Mathematical Society, (2), vol. 28 (1928), pp. 493-500; Z. W. Birnbaum and 
W. Orlicz, Uber die Verallgemeinerung des Begriffes der zueinander konjugierten 
Potenzen, Studia Mathematica, vol. 3 (1931), pp. 1-67; W. Orlicz, loc. cit. 


— 
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Since the set of simple step functions is a fundamental set, 
the necessity of (1) is obvious. In order that the integral 


exist for arbitrary 
1 
U,(x) =f K,,(s, t)«(t)dt 
0 


must belong to L* for arbitrary x(t) ¢Lj* and n. We wish to show 
that U,(x) isa linear operation on to We need the follow- 
ing lemmas. 


Lemma A. lim, ..||x,|| y= 0 implies lim asymp, ...*,(£) = 0. 


Set y,(¢) =a sign x,(t), where a>0, <1. Then 


Il 


1 1 ] 
|| sup = f yn(t)x,(t)dt 
y 0 0 


1 
a f | x(t) | dt 0. 
0 


ll 


But 


1 
lim f | X,(L) | dt = 0 implies lim asymp «,(¢) = 0. 
0 


n— 2 


Lemma B. lim,..||x,{\y=0 implies the existence in L¢ of a 
subsequence \xn,(t)}{ and an x(t) such that | x, ,(2)| | for 
every1=1,2,3,---, and almost all t. 


Let ||x,(¢)||, 0. Without loss of generality we may take each 
x,(t)20. Then there exists a subsequence {x,,(t)} ¢ {x,(t)} 
such that }>*,||xn:||)<M@, where M is a constant. This means 
that Dated converges in the space L* to a function x(t) eL*. 
Then 


| xn,(é)| = S x(2). 


LemMA C. lim asymp,,..%n(t) = x(t) implies lim, 
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Suppose the contrary. Then there exists a subsequence 
{x,,(é)} such that This means that 


1 1 
at| < || which implies dt < 
| | 
0 0 


for all sufficiently large. Since lim asymp;...x,,(t) x(t), there 
exists a subsequence x,,(t)—x(t) almost everywhere. Hence 
| n,(t)y(t) | | x(t)y(2) | almost everywhere for each fixed y(¢). 
By the Fatou Lemma we have 


1 1 | 1 
lim. | | dt= f | xy | dt = xydt). 


Since this must hold for every y(t), we have 


1 
lim = sup lim | | de = 
y dy 


|, =||x||,, a contradiction. 
Consequently, by a general theorem of Banach,f U,(x) is a 
linear operation. Hence || U,,(x)||y < M,J|x||y. But the sequence 


pat f K,(s, ¢ 


is bounded (for each fixed x(t) «L,j*) for every y(s)eL*. This im- 
pliest|| U,,(x)||, < M(x) independently of n. Then, by the Banach- 
Steinhaus theorem, 


which means that lim 


1 
= Lf K,(s, s 


where M is independent of n. 
To prove the sufficiency we set 


B 1 
f K,(s, = f K,.(s, t)Was(t)dt = w,(s), 
a 0 


+ S. Banach, Théorie des Opérations Linéatres, 1932, p. 87. 
t A. Zygmund, loc. cit., p. 99. 
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where wag(t) is the characteristic function of the interval (a, 8). 


From (1) we have 
b b 
J W,(s)ds Wag(s)ds. 


1 1 
-{ visas f K,,(s, -f y(s)w,(s)ds. 


Then if A= M||weally and = max (1, since 


1 
Pwp/d =f (>| w,(t) | = 1, 


we havet 


Let 


| (9) | Ally|l S sll 


where the right-hand side of the inequality is independent of n. 
Hence from a general theorem on linear functionals,t we have 


1 1 
f y(s)Waa(s)ds 
0 0 
for arbitrary y(s)¢«L*. Now set 
1 1 
£n(x) -{ visas f K,(s, #)x(t)dt. 
0 0 


This gives a sequence of linear functionals defined on Lj‘. For 
every Was(S), 


1 
0 
Moreover, since§ 


|} 1 ll 
| of || Kats, || of 
ly 


7 A. Zygmund, loc. cit., p. 97. 
tS. Banach, loc. cit., p. 123, Theorem 2. 
§ A. Zygmund, loc. cit., p. 97. 
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we see that the modulus of the functional g,(x) is SM, in- 
dependently of x, and we may again apply the theorem on func- 
tionals used above to obtain the desired result. 

As a corollary of Theorem 1, we have the following theorem. 


THEOREM 2. Necessary and sufficient conditions that 


for every pair y(s)eL,, x(s)e Ly, 1< are: 


b 
(1) f as f K,(s, t)dt > meas (6-7); 


1 1 |p’ 1/p’ 
(2) (f | f K,,(s, t)x(é)dt| as) < M||x||,-. 
0 0 | 


|For the case x(s) €L, y(s) =L,, the theorem holds provided 
we replace in (1) the interval (a, b) by the arbitrary measurable 
set E. The proof for this case is essentially the same as that of 
Theorem 1. ] 


THEOREM 3. Necessary and sufficient conditionst that 


for every pair y(s)eL, x(t)eM are: 


b 
(1) f as K,,(s, meas 6- E; 
a E 


1 
(2) ess sup | K,(s, t) | M. 
5 0 


Except for the necessity of (2) the proof is exactly that of 
Theorem 1. As in Theorem 1 we have 


| 
ess sup f K,(s, t)x(t)dt |= 
0 


7 J. D. Tamarkin, Zentralblatt, vol. 10 (1935), pp. 349-350, in review of 
paper by I. Natanson, Bulletin, Société de Physique et Mathématique, Kazan, 
vol. 3 (1934). 


1935-] GENERALIZED PARSEVAL’S RELATION 717 


Letting x(t) =e(t) be the characteristic function for the set E, 
we have 


| 1 
ess sup) f K,(s, 
s 0 


This impliest that (2) holds. 
If, in Theorem 2, condition (2) were replaced by either 


(2’) Lit |K,(s, s M, 


or 


1 1 plp’ 1/p 
a)" 
0 0 


= 1,2,---), 


the conditions of Theorem 2 would be sufficient but no longer ne- 
cessary. For example, the Dirichlet kernel, D,(s, t)=D,(t, —s), 
(where the fundamental interval is (0, 27) and the functions 
are assumed periodic) does not satisfy either of these conditions 
but does satisfy those of Theorem 2. We note that for p=2, 
(2’’) implies (2’) and for 1<p2, (2’) implies (2’’). 


THEOREM 4. Sufficient conditions that 


1 1 1 
f visas f K,(s, t)x(t)dt > y(s)x(s)ds, 
0 0 “0 


where x(s) eLj*, y(s) are: 


b B 
(1) f as K,,(s, t)dt meas (6-7); 
1 
(2) ess sup | K,(s, dt < M, (n = 1,2,---); 
s 0 
1 
(3) ess cup | Kn(s, 2) | ds=M, (n = 1,2,---) 
t 0 


{7 Saks and Tamarkin, Annals of Mathematics, vol. 34 (1933), p. 600. 
Theorem 2. 


(n =1,2,---). 
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We set u,(s) =f\K.(s, t)x(t)dt and consider 


un(S) K,(s, t) 


1 
ess cup | K,(s,#) | dt<1 
0 


Since 


and (0) =0, we have by Jensen’s inequality 


| | K,(s, t) x(t) 
K,(s, #)| 
| Welle i) 


t 
sof K,(s, t) | dt. 
M 
Consequently 


1 | t 

1 | t 

al | K,(s, #)| ds 
| 

llelly 

We have thus shown that 1,(s) = s, t)x(t)dt takes x(t) 


into u,(s)¢eL/# for arbitrary m, and hence condition (2) of Theo- 
rem 1 follows as in the proof of the necessity in that theorem. 


lA 


IIA 


Brown UNIVERSITY 


x(t)dt. 
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ON THE COMPOSITION OF QUADRATIC FORMS* 
BY E. D. JENKINS 


1. Introduction. A compound of two binary quadratic forms 
filxi, 1) and fo(x2, ye) is a third form f(x, y) such that f(x, y) 
=fi(x1, yi)f.(x., y.) under a primitive bilinear transformation 


= + poxiye + psyirte + payiye, 
Y = + + + 


The fundamental problem of this theory is to determine a com- 
pound of two given forms and the transformation under which 
the relationship exists. 

This problem has been considered{ by Gauss, Arndt, Dede- 
kind and others. The method of Dedekindt was based upon a 
correspondence between forms and moduls in an algebraic field, 
composition of forms corresponding to multiplication of moduls. 

The method of the present paper is also based upon a cor- 
respondence between forms and moduls. These moduls form a 
subclass of those considered by Dedekind in that only integral 
moduls are employed. These moduls in turn are in correspond- 
ence with matrices with rational integral elements, the greatest 
common divisor process corresponding to multiplication of 
moduls and hence to composition of forms. By this corre- 
spondence and the matric g. c. d. process of Chatelet,§ the com- 
position of quadratic forms requires but a small fraction of the 
time and labor required by previously known methods. 


2. The Correspondence. Let 1, 6 be an integral basis for the 
quadratic field §(./m). Then 6? =m if m=2, 3 (mod 4), (Case 1); 
and 6?=0+m’,4m’ =m—1ifm=1 (mod 4), (Case 2). Let a,b,and 
k be rational integers, of which a and & are positive, such that 
b?— k?=0 (mod a), (Case 1); b?+-bk —k?m’ =0 (mod a), (Case 2). 


* Presented to the Society, April 19, 1935. 

+ Gauss, Disquisitiones Arithmeticae, 1801, §234 ff.; Arndt, Auflésung einer 
Aufgabe in der Composition der quadratischen Formen, Journal fiir Mathematik, 
vol. 56 (1859), pp. 64-71. 

t Dirichlet, Vorlesungen iiber Zahlentheorie, 4th ed., 1894, pp. 640-657. 

§ A. Chatelet, Groupes Abéliens Fints, 1924, p. 26. 
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The general number a = a;x+ae2y of the modul with basis 
(1) a,=a, as = b+ ké 


has the norm 


b? — k?m 


a 


N(a) 


b? + bk — k?m’ 
a( as" (2b k)xy - 
in Cases 1 and 2, respectively. We shall say that the form 


b? — k*m 

f(z, 9) = ax + + 
+ bk — k*m’ 
I(x, y) = ax? + (2b + k)xy + —— 


a 


is associated with the basis (1) and also with the matrix 


a 0 
(3) G= ( ). 
b k 


The fact that we consider only irreducible forms makes it neces- 
sary that a~0 and 
The totality of bases of the above modul is given by 


(4) Bi = C1101 + Bo = C2101 + 


where a1, @2 is the basis (1) and the c;; are rational integers of 
determinant +1. Then all bases of the modul have matrices 
of the srm CG, where C=(c;;) is a unimodular matrix with ra- 
tional integral elements. 

Let 8=81x1+ey1. Define the transformation 


(5) X = + V = 1241 + Co2V2 

of determinant +1. Then B a,X +a2/Y, and in Cases 1 and 2, 
respectively, 

—- = aX? + 20XY + —— F?, 


a a 


filmi, yi) = 


b? + bk — _ 
aX? + (2b + k)XY + —————__ F?,, 


a 


| 
| 

| 
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is a quadratic form which, by (5), is either properly or improp- 
erly equivalent to f(x, y). Since every basis of the modul can 
be reduced by a transformation of determinant +1 either to the 
basis (a, b+ 8) or to the basis (—a, b+), the form represented 
by any basis of the modul is properly equivalent either to the 
form (2) or to its opposite, 
b? — k?m 
g(x, y) = ax? — 2bxy + jy’, 
a 


b? + bk — k?m’ 
ax? — (2b + k)xy + ——————__ ’, 
a 


in Cases 1 and 2, respectively. However, the form g(x, y) can be 
represented by the matrix 


a 0 a 0 
G’ = ( ). (Case1), G’= ( ), (Case 2), 
—bB k —(b+k) k 


with a and k positive, so we need employ only matrices of type 
(3), or matrices which can be reduced to type (3) by a trans- 
formation of determinant +1. 

Next, consider an irreducible form 


(6) fla, y) = ax? + Bry + cy’, (a > 0). 


The problem is to determine what integral algebraic modul will 
define the above form. 

Consider the discriminant d=B?—4ac=[?m, where m con- 
tains no square factor greater than 1. Consequently m=1,2,or 3 
(mod 4). If m=1 (mod 4), B?=P (mod 4) implies B—/ is even. 
Hence we define integers b and k such that k=/, 26+k=B. If 
m=2, 3 (mod 4), B?=l?m (mod 4) can hold only if B and / are 
both even. Hence we define 2k =/, 2b=B. In either case 


dé, a. = b+ 


is a modul of type (1), whose existence we supposed, and this 
modul yields form (6). 

We wish to determine a necessary and sufficient condition 
that a matrix of positive determinant and having k as the posi- 
tive g. c. d. of the elements of the second column, correspond 
to a quadratic form. 

Let I be the matrix* 

* C. C. MacDuffee, Annals of Mathematics, (2), vol. 29 (1928), p. 200. 


| 
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re (0 0 
1 


for Cases 1 and 2. Consider first a matrix of type (3). In Case 1, 


0 = ea+ fb, ak = fk, 
k*m = ga + hb, bk = hk. 


Then 


Since k ~0, we see that f=a, h= b, e = —b, g= — (b?—k?m)/aare 
rational integers if G corresponds to a form. Conversely if 


is a matrix with rational integral elements, b?— k?m=0 (mod a) 
and G is a matrix of a form. The proof for Case 2 is similar. 

If G’ is any non-singular matrix of positive determinant, 
then* G’=CG, where C is an integral matrix of determinant 
+1, and G is a matrix of type (3). If G’ is a matrix of a form, 
then G=G'G’ is also a matrix of a form, whence, by the above 
proof, 


(7) GkT = AG, 

where A’ is a matrix with rational integral elements. Hence 
(8) = CA'C'G’ 

where CAC! is also a matrix with integral elements. Since (8) 
also implies (7), we have proved the following theorem. 


THEOREM 1. A necessary and sufficient condition that a non- 
singular matrix G’, of positive determinant, the g. c. d. of the ele- 
ments of whose second column is k, represent a quadratic form of 
discriminant 4k?m (Case 1), k?m (Case 2), is that 


= AG’, 


where A is a matrix with rational integral elements. 


* C.C. MacDuffee, Theory of Matrices, 1933, p. 32. 


= 
let 
| 
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3. Product of Moduls. We define the second or S-matrix of 
any algebraic number B=b,+520 to be the matrix 
S(6) = 57, 


where I is the identity matrix and T is defined as above. The 
theorem which was proved* for ideal multiplication in a linear 
algebra holds for the multiplication of integral algebraic moduls. 
That is, if G; and Ge are the basis matrices of the moduls 
M,= a2| and Mz= Bz], a matrix of the product modul 
M is a greatest common right divisor of G,S(6;) and G,S(B2). 
We shall prove the following theorem. 


THEOREM 2. A matric g.c.r.d. (of positive determinant) of 
G,S(B1) and G;S(B2) represents a quadratic form. 


Let the two given matrices, representing forms f;(x1, yi) and 


f2(x2, be 
ay, 0 a2 0 
G; = ( ), G2 = ( ). 
bi ky be ke 


The matric g. c. r. d. of G,S(81) and G,S(82) may be put into 


the form 
a 0 
o= (5 
b k 


where a and & are positive integers. Moreovert 


(0) GS (61) L(G 
0 
Case 1. 
0 ayke 
6 = + kikom bike + 


By (9), comparing elements in the first row and second column 
on both sides of the matric equation, we see that 0=qik. Since 
k+0, q:=0, and by comparing elements in the first row and 
first column, we see that there exists an integer p; such that 


* Grace Shover and C. C. MacDuffee, this Bulletin, vol. 37 (1931), pp. 
434-38. 
¢ A. Chatelet, Groupes Abéliens Finis, 1924, p. 26. 
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(10) = pia. 


Also from (9) it is seen that there exist rational integers p, q, 
r, s such that 


b = payde + ghide + raibe + s(bibe + Rikem), 
k = + + s(bike + kibe). 
— = (b? — + ra? (b? — 
+ s*(b? — k?m)(b? — k?m) 
+ — k2?m) + 2rsayb,(b? — k?m) 
= aycy + ra? doce + + 
+ 
= 0 (mod 442) = 0 (mod a). 
Similarly, in Case 2, b?-+bk —k?m’=0 (mod a). 
Since the g. c. r. d., taken in the form above, represents a 
quadratic form, any g. c. r. d. of positive determinant will also 


represent a form. 
Let the general numbers of the moduls represented by mat- 


rices G;, Ge, and G be 
= a,x, + (bi + B = d2xX2 + (be + kof) ye, 
y =ax+ (b+ kO)y. 


There exists a primitive bilinear transformation under which 
aB =v. Then, since N(a) N(8) = N (7), it follows that 


V1) fo(%2, Ye) = ag(x, y); 


where ¢ is the form represented by matrix G. By (10), we have 
the following theorem. 


THEOREM 3. The form (x, y), represented by a matric g.c.r. d. 
of GyS(B1) and G,S(B2), is such that 


Pifi(xi, ¥1)fo(x2, v2) = O(x, 
where p, is the integer defined by (10). 


Dedekind has established* a correspondence between binary 


* Dirichlet, Zahlentheorie, 4th ed., §187. 
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quadratic forms and algebraic moduls. The moduls we consider 
form a subset of those considered by Dedekind, but a subset 
which is very convenient from the computational standpoint. 

Let the matrix G, given by (3), correspond to the form f(x, y). 
By the Dedekind theory, the algebraic number w = (b/a) + (k/a)0 
satisfies the irreducible equation 


a 


1 b? — k*m 

— 2bw + = 0, (Case 1), 
1 b? ++ bk — k?m’ 
— (2b + kw + =0, (Case 2), 
where / is the g. c. d. of a, 2b, (b?>—k?m)/a, or of a, 2b+-k, 
(b?+bk —k?m’)/a for the respective cases. If f’(x, y) denotes the 
form corresponding to the matrix G under the Dedekind theory, 
f(x, y) =If'(x, y), where / is the g. c. d. of the coefficients of 
f(x, y). It is also clear* that for the modul M, corresponding to 
G, the norm N(M) =a?~+(a/l) =al. Since 


N(M,)N(M:) = N(M), 


where modul MM is the product of moduls M, and Mz, it follows 
that a,d/,l,=al. Since, by (10), a1a2=f,a, the above equation 
becomes 


(11) Pilile = l or Lyle = 


If fi , f¢,and f’ are the forms corresponding to moduls M,, Mo, 
and M under the Dedekind theory, then f/f? = f’, where f’ 
is a compound of ff and f/. Multiplying both sides of this 
equation by (112), we have fi(%1, y1)f2(%2, ye) = f(x, y), where 
f is acompound of f; and fe. But f;: and f2 correspond to matrices 
G, and G2, while the form ¢(x, y) =/f’ =pif(x, y) corresponds 
to G. Hence we have the following theorem. 


THEOREM 4. The form (x, y) which corresponds toa g.c.r. d. 
of G,S(B1) and G,S(B2) is such that $(x, y) = pif(x, y), where 
f(x, y) is a compound of fi(x1, y1) and fo(x2, y2), and ts 
an integer. 


4. Conclusion. A computational method for the composition 
of forms is now at our disposal. Furthermore, if we restrict our 


* Ibid. 


— 
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matrices to reduced forms, the element in the upper right corner 
being a 0, the form may be read directly from the matrix with- 
out reference to the modul it defines. For other forms of the 
matrix we can find the corresponding binary form from the 
norm of the general number defined by the matrix. In either 
case we have a simple and direct method for determining a com- 
pound of two given forms. 

As an example, let us take f1(x1, y1) =5x? +2x1y1+5y?f, and 
fo(xe, v2) = 77x? +20xeye+2y2. Their discriminants are d; = —96 
= 16(—6) and d.= —216=36(—6), so the forms are associated 
with the field §(./ —6). Since —6=2 (mod 4), 


01 
Ke ( ). 
—6 0 
We note that 16(—6) =4k?m=—24k? and 36(—6) = —24k?, 
whence’ ki=2, ko=3. Then 


( 385 00 0) (385 0 0 0) 
| 77 154 0 0] y 183 1 0 0 
| @ 000 0}’ 
|\—26 23 0 0 | 000 0 


so 
we 
G= 
183° 4 
and, since $;=1, the compound is the form 
f = 385x? + 366xy + 87y?. 


By equating the coefficients of 1 and @ in the equation aB=y, 
the bilinear transformation is found to be 


X= — — 73 x2y1 — 
y= + 154x291 + 23 


Tue Onto State UNIVERSITY 


5 0 770 
1 2 10 3 
385 °) 50 15 
154)’ S@)=(_ 93): 


1935-] CERTAIN CONTINUED FRACTIONS 72 


=! 


ON CONTINUED FRACTIONS OF THE FORM 
1+ K (b,2/1) 


BY H. S. WALL 


1. Introduction. The principal object of this paper is to deter- 
mine the region of convergence of the infinite continued fraction 


when };, be, b3, - - - are real or complex numbers such that for 

some k2=1 

(2) lim barzm = Om, (m=1,2,3,---,k). 


The results may be stated in terms of the numerators and 
denominators tn,,, n,, of the mth convergent of the continued 
fraction 1+ (Gnkim=Om), as follows. 


THEOREM 1. Let? us write and 
Ay, and let us set 
Z(z) = — (— 2)*o102--- 


Let R be an arbitrary bounded closed and connected region of the 
z plane containing the origin on the interior and which contains 
(within or upon the boundary) none of the zeros of the polynomials 
G,, H;., nor points z such that Z(z) is a real number S —1/4. Then 
(1) converges over R except at certain isolated points pi, po, --- , 
Py, and uniformly over the region obtained from R by removing the 
interiors of small circles with centers pi, po, - + - , Pu. The limit is 
a non-rational function of z analytic over R except at pi, po, - - - 
Pu, which are poles. 


The function Z(z) determines a transformation of the z plane 
into the Z plane and Z=Z(z). Except in the case oy02- - - ox 
=0, the set of points in the z plane such that Z is real and 


We write un,o= tn, and Un,0=%n. 


= 
= 
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< —1/4 is a portion of a curve C; to which corresponds under 
the transformation the real Z segment (— ©, —1/4). The curve 
C; is a stelloidt or Holzmiiller{ hyperbola in the plane of 1/z. 
In particular, C, is a straight line, and C, a circle in the plane of z. 

In §2 we prove Theorem 1; §3 contains a discussion of the 
curves C;; and §4 contains examples and a discussion of the 
power series which corresponds to (1). 


2. Proof of Theorem 1. If N,(z)/D,(z) is the nth convergent 
of (1), there are k continued fractions 


Kn= + K Z?/1), (m = 1,2,---, &), 


with convergents V=0, 1, 2, 3, ---), and § 


Zo = ? 

Dn-1,0 

m (— - - - 
(3) Zi = ? 
Dm—1,0D 
n = 


Dox-1, (n—3)k+-mDox-1 »(n—2)k+m 


for n=>3, and where and is the nth 
convergent of 


By a known theorem, || if 


(5) lim Z, = Z(z) 


n=2 


(4) 


uniformly over a region R’, where Z(z) is a continuous function 
having nowhere in R’ a real value < —1/4, then there exists an 
index N such that if = N, the continued fraction K,~,(Z7",,/1) 
converges uniformly over R’ to an analytic limit F,(z). Then if 


+ Fouret, Comptes Rendus, vol. 106 (1888). 

t Holzmiiller, Einfiihrung in die Theorie der isogonalen Verwandschaften, 
1882, p. 170 and p. 203. 

§ Perron, Die Lehre von den Kettenbruchen, 1913, p. 200. 

| Ibid., p. 285. 
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N, /D, is the nth convergent of K», the latter will converge 
over R’ to the limit 

Ni + F,Ni-1 
(6) 
Di + 
provided the denominator in (6) is not =0. But if, as we now 
suppose, R’ contains the origin on the interior, this is impossible 
because the denominator = 1 when z=0. Hence K,, converges to 
a function which is analytic except for poles, and clearly con- 
verges uniformly in the region obtained from R’ by removing the 
interiors of small circles having these poles as centers. Also, Km 
converges uniformly in the vicinity of the origin. From this it 
follows{ that if R’ is connected, and if (2) holds for m=1, 2, 
3,---, % uniformly over R’, and Z(z) is independent of m, 
then K,=K.,= -- - =K,=P(z), where P(z) is the power series 
corresponding to (1), and hence (1) converges after the manner 
of K,, to the same value. 

It remains to be shown that under the hypothesis (2), (5) 
holds over the region R described in the theorem, and that 
Z(z) = —(—2)*o102 - - - We have, if 6,=nk-+m, 

(7) = + — 
and hence, if ), 
m —(- 2) + 


Zz = 
A,-3 


where 


By (2); €,=0, Nx.,3,= Uk.m; 
=Vz,m uniformly over R. Also 


Vi,m + Uk-1,m — = VE + — Ve-1 = Hi, 


for all m=>1. It now follows that over the region R 


lm Z, = — (— 2)*o102--- o:/H,?, (m = 1, 2,.- += 
n= 2 


uniformly, as was to be proved. 


{ Ibid., p. 342. The argument used there applies with slight modification to 
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In case o\02--- o,=0, it is clear that Z(z) =0, and there- 
fore can never be real and < —1/4. In this case (5) holds uni- 
formly over every bounded region from which the neighbor- 
hoods of the zeros of G;, H; have been excluded. It may happen 
that these neighborhoods need not be excluded. More generally, 
even if (2) fails to hold, we have the following theorem. 


THEOREM 2. If for some integer k=1 the functions Zy’, (m=1, 
2,---, k), defined in (3) converge uniformly to 0 forn=© over 
every bounded region, then the continued fraction (1) represents a 
meromorphic function of z and converges except at the poles of that 
function. 


When &=1 this reduces to the conditionf lim 5, =0 found by 
E. B. Van Vleck. In §4 we shall give examples illustrating this 
theorem in the cases k=3, 4. 


3. Discussion of the Curves Cy. Put p=(—1)*oi02--- ox, 
and let +0. By Ci we shall understand the set of points in the 
z plane which is the image of the real Z segment (—1/4, — ©) 
under the transformation 


Z = — 


Then C/ is a portion of a curve C; which is the image of the 
negative half of the real Z axis under this transformation, and 
is a cut for the function represented by (1). 

If (2) holds, then 
(8) lim = om (m = 1,2,3,---, 

n= 

where g=2k and On4.4=Om =Om. If we had started with the 
hypothesis (8), then instead of the function Z(z) we would have 
Z'(s) = 21/H? ; and 


(9) G, =G2Hs, 

(10) H, = H2 — 2p*, 
1 

(11) Z'(z) = 


(2+ 


In fact, if we let n= in (7), we obtain the relation v,_1,m 
=v:_1,mll;, from which (9) follows at once. From three rela- 
tions analogous to (7) we obtain the identities 


{ Ibid., p. 345. 


= 
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= VE + — VE), = + 
= + — Ve-1), 


and consequently H,= = — 2pz*, 
which is (10). Finally, on eliminating —pz* between the rela- 
tions 


2 
Z=-—ps'/H2, Z'=-- |- | 
(H2 — 2pz*) 


we obtain (11). 

It follows from (11) that C/ is the same as C/ ; and from (9) 
we see that the zeros of G, are those of G,/7,. When +0, the 
zeros of /7;, and of H, lie on the cut; and when p=0, the zeros of 
H, are the same as those of H;, by (10). Hence when & is odd in 
(2) we may turn to (8) instead and obtain precisely the same 
region of convergence of the continued fraction (1). There is no 
loss in generality in assuming k even in (2). 

In order to identify the curves C;, and determine C/ it will 
be convenient to replace z by 1/z’ and study the corresponding 
curve E; in the plane of z’=1/z, and the portion Ef? of E, 
which corresponds to C/. If, as we now suppose, & is even, say 
= 2q, then H;, is a polynomial of degree g of the form 1+ )>{A,2”. 
We find that 


(— 


As Z(z) ranges through real values £0, 1/(Z(z))!/? ranges through 
pure imaginary values, 2’ over E;, and z over C;. As Z(z) ranges 
through real values from —1i/4 to 1/(Z(z))"/? ranges 
through pure imaginary values from — 2i to + 2%, zs’ over Ez, 
and z over C/. Set 


pi? = — A, = a,e'%, 2’ = re*®, 


where ¢ is any one of the possible arguments of p'/*; G, a, are 
real and positive, and a,, 0, r, X, Y are real. We have 


to 
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q 
X = cos (a, + q— V8), 


v=0 
Y = a,rt” sin (a, + q — v9), 
v=0 
(a9=1, ao=0). Let ¢ be real. Then £; is given parametrically 
by the equations 
X =tcos¢, Y =¢sin@. 
On eliminating ¢ we find that EF; is the stelloid or Holzmiiller 
hyperbola 
(12) X sing — Y cos¢ = 0, 
and that E/ is that part of EZ; for which 
(13) X? Y? <G?. 


If g=1 (k=2), E is a straight line, and (13) is the interior 
of a circle with center on FE. The curve CY is an arc of a circle, 
and Ej is a rectangular hyperbola. In case g=2, Cy and C{ can 
be determined by the following special method. First determine 
61, 62 by the conditions 


Ay =o, t+ 64 = + 4s), 


Az = + 204 = 67 + 57, 


(14) 


Ih 


so that 
A, + (— 1)"(8A2 — A?)"/? 


(v=1, 2), 8 6:62 =A? — 


If 5,6.=0, then A? —4A.=0. Hence if either A; or Az is zero, 
the other is also. If 6:62.40, put 


01020304 


(5152)? 


261 + + (6? + 


Then Z=AZ,. The function Z, is of the form of Z’ in (11) (for 
the case k=4), so that 


Z, = — 1/(2 + 1/22)’, 


= 
A= 
(6352)24 
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where A= When 6:6. =A,=A.=0, it is 
easy to see that Z = —o 102030424; and when 6:6.=0, A142+~0, 
Z = —060203042'/Ao?(2+2/A;)*. We find that there are four cases. 

CAsE 1. 6:6. =A,=A2=0. The curve C/ consists of four rays 
running from 2, to © in the direction from 0 to 2,, where Zn, 
(n=1, 2,3,4), are the four fourth roots of 1/(40102030;). In this 
case 0,0; or else 203, so that the case k=2 is not included. 

CAsE 2. 6:6.=0, A142+~0. In the plane of z/(z+2/A;) the 
cut consists of four rays as in Case 1 except that the four 
fourth roots of A? /(40102030,) are the initial points of the rays. 
In the plane of z the cut consists of arcs of two circles. Here 
01403 OF 

CASE 3. 6:620, 61: +62.=A,1/2=0. In the plane of 2? the cut 
is an arc of a circle. We may have o1=03, o2=0; if and only if 
01= —02. Thus (2) may hold with k=2. The cut consists of two 
rays running to © in this degenerate case. 

CAsE 4. 5:+6240. We may set 6,+62=26, and apply 
(11) (for the case k=2) to the function Z.62/(6,6.). We find 
that in the plane of (2+1/(6z))? the cut is an arc of a circle. If 
01 =03, 02 =04, 015% (2) holds with k=2, and the cut is an 
arc of a circle in the plane of z. If ¢,=02=03=04, (2) holds with 
k=1 and the cut is the ray running from —1/(401) to © in the 
direction from 0 to —1/(40;). 

If the o, are real and positive it is easy to show that 6,6.+0, 
6,+6.~0, so that Case 4 obtains. In this important case one 
may show that the cut is a portion of the negative half of the 
real z axis. The polynomial G, is 


(1 + + o22)(1 + + o32)(1 + 032 + o4z)(1 + + 


4. Examples and Applications. The following examples have 
been selected for the purpose of bringing out interesting points 
which might otherwise be overlooked. 

EXAMPLE 1. Let b3n-2=¢n0, lim c,=0, —b3,=a40. 
Here we have, with k=3, 


Ze 
(1 Cn—12) (1 + Cn2) 
Z2 = + 4+ 
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2°a7c, 


+ 


= 


Since lim,-.Zn7 =0, (m=1, 2, 3), uniformly over every bounded 
region, the continued fraction represents a meromorphic func- 
tion of z by Theorem 2. 

EXAMPLE 2. Let = limc,=0, Dan = —Dan_1 
=a>0. If b;=1/ai, bn =1/(dn_14n), (n=2), then den41>0. Since 
a2n41 diverges, it follows from the work of Hamburgert that 
the continued fraction converges except upon the real axis. To 
prove that it represents a meromorphic functionf{ it is but 
necessary to note§ that when k=4 


Ze = 


(1 — ac,_22”)(1 — 


so that lim Z,} =0 uniformly over every bounded region, and 
therefore K, represents a meromorphic function. 

EXAMPLE 3. If lim sup| b,,| <g, we may show 
that (1) represents a function which is analytic except for 
poles in the region|| |z| <1/(2g). In fact, if k=2, lim,-.Z% 
=0, (m=1, 2), uniformly over this region inasmuch as the 
polynomials D3,2nim=1+(benymi2t+ben+m43)2 have no zeros in 
this region if ” is sufficiently large; and lim,_., by bs =0. 

EXAMPLE 4. According to Theorem 1, the real segment (1, ©) 
is to be excluded from the region of convergence of the contin- 
ued fraction (1) if the b’s have the values };=1/2, b. = —1/2, 
bon = —1/(2(1+[n+1]?/n?)). 
One may show that this continued fraction converges or di- 
verges at z=1 (a point on this segment) according as p is >1 
or <1. In fact, when z=1, the mth convergent is 


+t Mathematische Annalen, vol. 82, pp. 120-187. 

t See this Bulletin, vol. 39 (1933), pp. 946-952, in which another example 
is given to show that (1) may represent a meromorphic function when the b, 
are real and benbeny1>0, lim sup|b,| >0. In that example convergence was 
established except at the poles of the function, whereas here the question of 
convergence at points on the real axis is not considered. 

§ The other Z,”, (m=2, 3, 4), are not all so simple in character. 

| If the condition lim ),b,,:=0 is dropped, the same holds in the region 
| z| <1/(4g) (see Perron, loc. cit., p. 343). 
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i 1 
when 1 is even and 
1 1 J 1 
[(n — 1)/2]? 2[(n — 1)/2]” 


when 2 is odd. 

As is well known, (1) has a unique corresponding power series 
P(z) =>°c,2”, (co=1), from which (1) may be obtained by a 
repeated division process. If a given power series P(z) has a 
corresponding continued fraction of the form (1), it is said to be 
semi-normal.t{ When convergent, the continued fraction 
furnishes a method for summing the power series. Let P’(z) 
be semi-normal with corresponding continued 
fraction c,+K(b/2/1). Then if (1) converges to f(z) when (1) 
corresponds to P(z), it is important to know conditions under 
which c,+ K(b/ 2/1) converges to the value (f(z) —1)/z. 

It is well known that if (1) converges to f(z), then when 
a.+K(b/z/1) converges it must have the value (f(z) —1)/z. 
This follows from the fact that the even convergents of 
1+c2+2K(b/</1) are the same as the odd convergents of (1).t 

The numbers b, and b/ are related as follows.§ Set a,=1/01, 
ay =1/b{, an=1/(Ondn_1), =1/(bJad_1), (n>1). Then if 
hn=Qitast 


aden = Gen+1 = dony2h,? 


If (2) holds with k even (say = 2gq), then it is not difficult to show 
that when the dD, are real and 


(15) lim = > 0, (m = 3; k), 


we must also have 


1, 2,3,---,&), 


(16) lim = > 0, (m 


t Perron, loc. cit., p. 304. 
t Perron, loc. cit., p. 447. 
§ Transactions of this Society, vol. 31 (1929), pp. 102-103. 


n= 
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(17) lim Dirim = Om, (m = 1, 2,3,---,k). 

Hence if Theorem 1 is applicable to the continued fraction cor- 
responding to P(z) it is also applicable to the continued fraction 
corresponding to P’(z) provided P(z) has real coefficients and 
(15) holds. From (16), (17) it then follows at once that Theorem 
1 can be in turn applied to the continued fraction corresponding 
to P’’(z) 2428", provided the latter is semi-normal, etc. 
It is easy to conclude from the fact that two successive con- 
tinued fractions obtained in this way have an infinite number of 
convergents in common that the function Z(z) of Theorem 1 is 
the same for all these continued fractions. We shall summarize 
the result in the following theorem. 


THEOREM 3. If --- ts a semi-normal 
power series with real coefficients and corresponding continued 
fraction (1) such that for k=2q equations (2) and (15) hold, and 
if L=c,+K(b/2/1) is the corresponding continued fraction for 
P'(z) =cit+c2+032?+ - - - (supposed semi-normal), then (1) and 
1+2L converge to one and the same function f(z) over the region 
R described in Theorem 1. If P™(z) is semi- 
normal with corresponding continued fraction c,+K(b™z/1), 
(n=1, 2, 3,---, 1), then all the continued fractions 1+ 
+--+ +e,2"+2"K (62/1), (n=1, 2, 3,---, 1), converge 
over R to f(z). 


The continued fractions are precisely the continued fractions 
of “type 1” of a Padé table,t whose convergents are “stair- 
like” files of approximants beginning upon the horizontal side 
of the table. One can show that when E(z), the reciprocal of 
P(z), and the series E™(z) obtained by removing from E(z) 
the first 2 terms and the factor 2", (n=1, 2, 3, - - - ), are semi- 
normal, then under the hypothesis of Theorem 3 the continued 
fractions corresponding to stairlike files of approximants be- 
ginning on the vertical side of the table also converge over R 
to f(z). 


NORTHWESTERN UNIVERSITY 


{ Perron, loc. cit., pp. 447-448. 


and 
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ON THE MATRIC EQUATIONS 
P(X)=A AND P(A, X)=0 


BY E. T. BROWNE 
1. Introduction. The matric equation 
(1) P(X) =A, 


where P(A) is a polynomial with scalar coefficients and A is a 
given square matrix of order n, has received a good deal of at- 
tention within the past few years. The problem is to find square 
matrices X satisfying (1). This equation may possess solutions 
X which are expressible as polynomials in A. On the other 
hand, there may exist solutions, but none expressible as a poly- 
nomial in A; and finally, there are equations of the type (1) 
which possess no solution at all.* 

In 1928 Rotht found necessary and sufficient conditions that 
there may exist solutions of (1) expressible as polynomials in A, 
and he found the number of such solutions, in case any exists. 
He employed the theory of elementary divisors. In this paper 
Roth gave a bibliography which was quite complete up to that 
time. In 1931 Franklin attacked the problem through the 
canonical form, and found not only all solutions X that are 
expressible as polynomials in A, but also solutions that are not 
so expressible. Rutherford{ also employed the canonical form. 
Still more recently Ingraham§ discussed the problem using the 
theory of elementary divisors. 

Let F;(A), (j=0,---,m) be known polynomials in A with 
scalar coefficients, and consider the more general equation 


* Franklin, Algebraic matric equations, Journal of Mathematics and Physics, 
Massachusetts Institute of Technology, vol. 10 (1932), pp. 289-314. 

7 Roth, A solution of the matric equation P(X)=A, Transactions of this 
Society, vol. 30 (1928), pp. 579-596. 

t Rutherford, On the canonical form of a rational integral function of a 
matrix, Proceedings Edinburgh Mathematical Society, (2), vol. 3 (1932), pp. 
135-143. 

§ Ingraham, On the rational solutions of the matrix equation P(X)=A, 
Journal of Mathematics and Physics, vol. 13 (1934), pp. 46-50. 
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(2) = 0, 
i=0 

considered by Roth* in a later paper and by Franklin.{ This 
latter equation obviously includes (1) as a special case, and, 
since it presents little more difficulty than (1), we shall consider 
(2) first. Our problem then is to find necessary and sufficient con- 
ditions that there may exist solutions X of (2) which are expres- 
sible as polynomials in A, and to give a simple method for find- 
ing such solutions in case any exist. We shall not employ the 
theory of elementary divisors which leads to a somewhat com- 
plicated argument, nor the canonical form, since in actual prac- 
tise the reduction of a matrix to the canonical form is itself 
quite tedious. On the other hand, we shall show that the prin- 
cipal idempotent and nilpotent matrices associated with the 
matrix A lend themselves quite readily to a solution of the 
problem. 


2. The Principal Idempotent and Nilpotent Matrices Assoct- 
ated with A.t If Xisa scalar, the equation of degree n, 


A| =0, 


is called the characteristic equation of A. As is well known, A 
satisfies its own characteristic equation, but often this is not 
the equation of lowest degree that A satisfies. In fact, if (A) 
denotes the highest common factor of all the (m—1)-rowed 
minors of A—XI, and if we denote by ¢(A) the polynomial 
(A) /A(A), then (A) = 0 is the equation of lowest degree which A 
satisfies. We shall hereafter refer to ¢(A) as the reduced character- 
istic function and to the equation $¢(A)=0 as the reduced or 
minimum equation of A. 

Let us suppose that (A) when resolved into linear factors is 
of the form 


(3) 


* Roth, On the equation P(A,X)=0 in matrices, Transactions of this 
Society, vol. 35 (1933), pp. 689-708. 

Franklin, loc. cit. 

t Wedderburn, Lectures on Matrices, American Mathematical Society 
Publications, 1934, pp. 23-29. 


1935-] CERTAIN MATRIC EQUATIONS 739 


where the a’s are distinct. For r>1, we write 


kA) = 
(A) 


(4=1,---, 97). 


We can then determine two scalar polynomials M;(X) and N;(\) 
of degrees not exceeding v;—1 and N—v;—1, respectively, such 
that 


+ ai)” = 1. 
If we write 
= M,(A)A(A), (i= r), 


and for ¢;(A) write ¢;, then ¢; is the principal idempotent ele- 
ment of A corresponding to the root a;. The matrices ¢; satisfy 
the conditions 


(4) oF gi, 


for any positive integer k; 
t=1 


Moreover, these ¢’s are linearly independent and none is zero. 
Let us now denote by 7; the matric polynomial in A defined by 
the formula 


ni(A) (A ail)p(A), (¢=1, 2, r). 


It is easily shown that these matrices 7; satisfy the conditions 


(6) nt #0, (k <¥%); = 0; 
(7) = ni = Oni; nnj=90, (i #7); 
and, moreover, 
(8) A= > (adi + = + 
i=1 


The matrix 7; is the principal nilpotent element of A correspond- 
ing to the root a;. If r=1, so that @(A) reduces to (A—a)”, 
we take 6 = 1, n= A —al, and it will be seen at once that such of 


* Here and throughout the remainder of the paper we have designated the 
scalar matrix a;IJ merely by the symbol a,. 


= 
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the conditions (4), - - - , (8) as are applicable hold also in this 
case. 


3. Lemma 1. If the $’s and n’s are defined as in the preceding 
section, and if f(A) is any polynomial in A with scalar coefficients, 
then 


(9) fA) = +n) = 


where the f; are polynomials determined uniquely by f(A). Con- 
versely, any such expression is equal to a polynomial in A. 


For r>1, we have from (8), in view of (4), (5), and (7), 
A? = +n)’, 
and, in general, if k is any positive integer 


Hence, if f(A) is a polynomial with scalar coefficients, it follows 
that 


(A) = of(ai + 05). 


If now f(a:+7;) be expanded and written as a polynomial f;(n;) 
in n:, we have (9). Moreover, it is clear from the manner in 
which they arise that the f; are unique.* 

The converse follows at once since each of the @’s and 7’s is 
a polynomial in A. The lemma also holds obviously for r=1. 


LEMMA 2. Let X be a matrix expressible as a polynomial in A, 
and therefore of the form on the right in (9). Necessary and suffi- 
cient conditions that X may be a solution of (2) are that the f; 
satisfy the relations 


(10) lai + ni) = 0, (¢= 1,---,79). 


For, by Lemma 1, F;(A) is expressible in the form 


Tr 


= > (ai + ni), (j = 0,---,m). 


i=1 


* Otherwise, if we assume that DY oifi(ni) =) ¢ivi(n:), it will follow, just as 
in the proof of the second part of Lemma 2, that f;=yi, @=1,---,7). 


r r 
_ 
m 
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Also, if X =f(A) =~’ -o¢-f-(n;), we have as above 
On substituting into (2) and making use of (4) and (5), we have 
(11) Fila + nid = 0. 
j=0 


The conditions (10) are therefore obviously sufficient. To prove 
that they are also necessary, multiply (11) through by ¢,, and 
we obtain 


Now, by hypothesis, the f; and all of the F’s are polynomials. 
Hence, since 7,;’'=0, each of the equations (12) reduces to the 
form 


(13) + + = 0. 


On multiplying through by 7; and recalling that ¢6m¥=(¢m;)* 
we obtain 


ao(dini) + + + = 0. 
Now the minimum equation of 7;=9.7; is 
(oni)? = 0, 
so that the matrices $:7;, - - - , (6:m;)”* are linearly independ- 
ent. Hence, 
=4,,2=0, 
and it follows from (13) that we have also a,,_: =0. The lemma is 


therefore established. 


4. The Matric Equation P(A, ~’) =0. If for brevity we denote 
the left member of (2) by P(A, X), the conditions (10) can 
clearly be written in the form 


(10’) Pla: + ni, fi(ni)] = 0. 


Since 7; behaves in all ways precisely as a scalar variable &, 
except that 7,”*= 0, and since a necessary and sufficient condi- 
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tion that (10’) hold is that the left member be divisible by 
n:”', we may state the following theorem. 


THEOREM 1. Jf a:, (¢=1,---, 7), are the distinct roots, of 
multiplicities vi, respectively, of the minimum equation of A, 
necessary and sufficient conditions that a matrix X, expressible 
as a polynomial in A, satisfy (2) are that there exist polynomials 
fiE), (G=1, - - - , such that the equations 


(14) Plact+é f()]=0, (i=1,---,7), 
possess roots of multiplicities v;, respectively, atE=0. If such poly- 
nomials f; exist, then X = oifi(ni) is a solution of (2). 


Assuming that the conditions of Theorem 1 are satisfied, let 
us now proceed to find the polynomials f;. Since 7;‘=0, the 
typical polynomial f; may be taken in the form 


(15) = Xo + mE + 
From (14) we have then as a first necessary condition 
(16) P(ai, %o) = 0. 


Any root of this equation will serve as xo. If v;=1, f; is deter- 
mined. If »v;>1, we differentiate (14) as to — and put &=0, 
whence 


(17) Pa, + “Pz, = 0, 


where the subscripts denote partial differentiation. If P.,+0, 
that is, if there exists a simple root of (15) which may be chosen 
for xo, then x; is uniquely determined. If, however, for every root 
xo of (16), we have P,,=0, it is necessary also that P,;=0 at 
£=0. Differentiating (14) a second time and putting £=0, we 
have 

Puja; + + + 2P2%2 = 0, 


and so on. 

It will be noticed that the successive coefficients x1, %2,---, 
of f; enter these equations for the first time with the coefficient 
P,,. This leads to the Theorem of Franklin.* 


THEOREM 2. A sufficient condition that the equation P(A, X)=0 
may have a solution X, expressible as a polynomial in A, is that 


* Franklin, loc. cit. 
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for each multiple root a; of the minimum equation of A, the equa- 
tion P(a;, xo) =0 have at least one simple root. 


5. Solution of the Matric Equation P(X)=A. In particular, 
if our matric equation is of the form (1), the equation (14) be- 
comes 


(18) P[fi(é)] = ai + &. 
In this case, (17) reduces to 
= 


so that the condition P,,4+0 is not only a sufficient condition 
that x; may exist, but a necessary one also. Since each simple 
root of (16) yields a unique polynomial f; corresponding to it, 
it follows from the uniqueness feature of Lemma 1 that we are 
led to the Theorem of Roth.* 


THEOREM 3. Let A be a square matrix whose minimum equation 
has the s multiple roots a, - - -,a,, and the t simple roots O41, - 
G44. The matric equation P(X)=A has a solution for X as a 
polynomial in A if, and only if, each of the equations 


P(x) = ai, (i=1,---,5), 
has at least one simple root. If these equations have respectively 


M1, Ms Simple roots and tf °° denote the number 
of distinct roots of the equations 


P(x)=a:i, 
the total number of solutions of (1) for X as a polynomial in A 1s 
* Ms+t- 


CorROLLARY. If the minimum equation of A has all roots distinct, 
the matric equation P(X)=A is always solvable for X as a poly- 
nomial in A. If w; denotes the number of distinct roots of the equa- 
tion P(x) =a:, the total number of such solutions 1s pipe -- >: ur. 


Tur UNIVERSITY OF NorTH CAROLINA 


* Roth, Transactions of this Society, vol. 30 (1928), pp. 579-596. 
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DISTRIBUTION OF MASS FOR AVERAGES OF 
NEWTONIAN POTENTIAL FUNCTIONS 


BY J. M. THOMPSON 


1. Introduction. It has been proved that the average of a po- 
tential function over a spherical volume and the average of a 
potential function over a spherical surface are themselves po- 
tential functions.* This paper is concerned with the determina- 
tion of the distribution of mass for these two spherical averages; 
in addition, the distribution of mass for more general averages 
is obtained. 


2. Preliminary Theorems. The problem is solved by means of 
a theorem on the change of the order of integration of an iter- 
ated Stieltjes integral. First it is necessary to state some pre- 
liminary theorems. We recall the following elementary theorem. 

If h(Q) is continuous in Q and g(e) is a distribution of positive 
mass, bounded in total amount, and lying ona bounded set F (which 
may be taken as closed without loss of generality), then, for the 
integral over the whole of space, w, 


(1) | f — X < 


where the summation 1s extended over all the meshes of a lattice Ls, 
of diameter <6, Q; is a point of the mesh e;, ws is the oscillation of 
h(Q) ona subset of F of diameter <5, and a=g(F). 

This theorem will be applied to the integral 


f P), 


where h*(M, Q) is continuous in M, Q, and g(e, P) and F are 
bounded independently of P, so that ws; and a in (1) are inde- 
pendent of M, P. 


THEOREM 1. Jf g(e, P) is a distribution of positive mass, 
bounded independently of P, on a set F bounded independently 


*G. C. Evans, On potentials of positive mass, Transactions of this Society, 
vol. 37 (1935), p. 250. 
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of P, and if f(e) ts a distribution of positive mass, bounded in all 
space, and if g(e, P) is summable with respect to f(e), then 


= f se, Prafter) 


is a distribution of positive mass bounded in total amount and lying 
on the set F. 


In this theorem, it is not required that g(e, P) be continuous 
in P, so that the integral must be interpreted as a generalized 
or Daniell integral with respect to f(e). But in the theorems 
which follow, g(e, P) is taken as continuous in P for every mesh 
of L. 

To prove that G(e) is a distribution of positive mass, we must 
show that G(e) is a non-negative, absolutely additive function 
of point sets (only Borel measurable point sets are considered). 

The first requirement follows immediately from the fact 
that g(e, P) and f(e) are non-negative for all P and e. In 
order to exhibit the second condition, let e=e:+e.+ - - - and 
--- +e,; as P) is a distribution of positive 
mass, we have 

g(e, P) = lim g(e, , P), with g(er41, P) = gles, P). 

Since g(e, P) is, as a function of P, the limit of an increas- 
ing sequence of functions, the order of integration and passing 
to the limit may be interchanged; so that 


G(e) 


J ste, Pranter) = tim Paster) 


lim [G(e1) + - - + G(¢n)]= G(er) + Glee) + 


Hence G(e) is a distribution of positive mass. Also G(e) is a dis- 
tribution of mass lying on F; that is, G(e) =0, if e- F= 0, because 
of the hypothesis that g(e, P) =0, if e- F=0. Finally, G(e) < [up- 
per bound of g(e, P) |-f(w), so bounded. 

We state the generalizations of two theorems proved by H. E. 
Bray; the proofs are omitted as they are essentially the same 
as those given by Bray.* They depend on the inequality (1). 


* H.E. Bray, Elementary properties of the Stieltjes integral, Annals of Mathe- 
matics, vol. 20 (1918-19), pp. 180-185. 
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THEOREM 2. If h¥(M, Q) is a continuous function of M and Q, 
and is bounded in all space, if g(e, P) is a distribution of positive 
mass, bounded independently of P, on a set F bounded independ- 
ently of P, and if g(e, P) is continuous in P for every cell e of a 
net L, then 


K(M, P) = f h¥(M, Q)dg(co, P) 


is continuous in M and P. 


THEOREM 3. If h¥(M, Q) is continuous in M and Q, and 
bounded in all space, if g(e, P) is a distribution of positive mass, 
bounded independently of P, on a set F bounded independently 
of P, and if it is continuous in P for every cell e of a net L, and if 
f(e) is a distribution of positive mass, bounded in total amount, 
lying on a bounded set E (which may be taken as closed without loss 
of generality), then the integrals 


f Paster) | 


exist and are equal. 


and 


We may now state and prove the concluding theorem in this 
series. 


THEOREM 4. If g(e, P) is a positive distribution of mass, 
bounded in total amount independently of P, on a set F bounded 
independently of P, and if it is continuous in P for every cell e 
of the net L, and if f(e) is a distribution of positive mass, bounded 
in total amount and lying on the set E, then 


f f al falco, raster], 


(or both are +2), where fug(e, P)df(ep) is a bounded distribu- 
tion of positive mass lying on the set F. 


= 
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If 


h¥(M,Q) = ——> if — < VN, 


the function 4*%(M,Q) is continuous in M and Q. 
By Theorem 3, we have 


= J seo Paster). 


Let N become infinite; we have 


f al J Paster) |. 


The interchange of integration and passing to the limit in the 
left-hand memberis justified because the integrand is an increas- 
ing function of N, while the definition of the integral of a lower- 
semi-continuous function with respect to a distribution of posi- 
tive mass was used in the right-hand member. Applying this 
definition now to the left-hand member, we have 


J a P) = J seo rafter) |. 


This, combined with Theorem 1, establishes the theorem. 


3. Volume Averages. This theorem just proved enables one to 
exhibit the distribution of mass for the spherical volume aver- 
age of a potential function in a form in which it may be evalu- 
ated; this is an illustration of the advantage sometimes gained 
by working with the more general situation. 

In the theorems that follow, u(Q) is the potential at Q of the 
distribution of positive mass, f(e), which is bounded in total 
amount and lies on a bounded set E. 


1 
=N, ff—>N, 
MQ 
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THEOREM 5. The average of the potential function u(Q) over 
a spherical volume T(r, M), of radius r and center M, 1s a po- 
tential function of a distribution of positive mass with density 
(3/(4xr*))f{ T(r, Q)}. In symbols, 


3 
) u(Q)ae 


3 


Q)}dQ. 


au(r, M) 


(2) 


Let us form this spherical volume average and determine its 
distribution of mass. We have 


3 
J r(r,M) u(Q)a0 


wf ier. 


As the integrand is a lower-semi-continuous function, the order 
of integration may be interchanged, so that 


The inner integral is the potential at P of a sphere of unit den- 
sity with radius r and center at M, which is equal to the potential 
at M of the same sphere with center at P. Hence, 


au(r, M) 


a,(r, M) ep) 


dQ 


r(r,P) 


P) 


where g(e, P) =m3{e-T'(r, P) os and m3; means the three-dimen- 
sional measure of the set indicated. The quantity g(e, P) is evi- 
dently continuous in P for every measurable set e. 

By means of Theorem 4, this volume average may be ex- 
pressed in a form which enables the mass distribution to be eval- 
uated, 


3 1 
df(ep) 
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Consider the integral, 


(3) a,(r, M) = 


ve) = ff mte-re, P)}df(er), 


where ¢ is an arbitrary bounded set measurable Borel. This func- 
tion is an absolutely continuous function of e; in fact 


v(e) S m;(e)-f(E), 


and v(e) is completely additive by Theorem 1. 
The integrand of v(e) may be expressed as a Lebesgue inte- 
gral, 


m3{e-T'(r, P)} 1dR = P)dR, 


e 


where we may define B(R, P) as 1, for RP <r, and 0, for RP=r. 
In this way B(R, P) is the limit of an increasing sequence of 
continuous functions of R and P, and we have 


v(e) = f anter) f P)dR -{ ar BR, P)df(ep) 
(4) w e e w 
= f sre, R)} dR. 


Substituting this result in (3) and making use of the definition 
of the integral of a lower-semi-continuous function, we have 


3 1 
{Tr 
a,(r, M) al f strc R)}aR | 


lim | 2*(M, R)jaR |. 


4rr® N=0 


As h*(M, Q) is bounded and continuous, and v(e) is absolutely 
continuous, we may change the Stieltjes integral to a Lebesgue 
integral. This gives 


3 


| 
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3 1 
Ola, 


and the proof is complete. 

It should be pointed out that the work of this section holds 
for the average over any three-dimensional open set. Let s(Q) 
be an open set (therefore of positive spatial measure), s(/) the 
set obtained by displacement of s(Q) as a rigid body, without 
rotation, so that O falls on M, and let s’(Q) be the reflection of 
the set s(M) through the midpoint of the line MQ. 


THEOREM 6. The average of the potential function u(Q) over the 
set s(M) ts a potential function of a distribution of positive mass 
with density (1/[mss(M) ]) f{s’(Q)}. In symbols, 


1 
1 
ae. 


The construction of the set s’(P) gives the potential at P of 
the set s(/), of unit density, equal to the potential at M of the 
set s’(P), of unit density; hence all the transformations made 
in this section on the spherical volume average are valid for the 
average over the set s(/). 


4. Spherical Surface Average. By means of Theorem 4, we may 
also determine the mass function for the average of a potential 
function over a spherical surface. 


THEOREM 7. The average of the potential function u(Q) over the 
spherical surface C(r, M), of radius r and center M, is a potential 
function of the distribution of positive mass, 


1 
al m2 {e-C(r, P) } df(er). 
In symbols, 


1 
A,(r, M) = u(Q)dQ 


J Cr,M) 


1 1 


1935-] AVERAGES OF POTENTIAL FUNCTIONS 751 


Treating the spherical surface average in the same manner as 
we have treated the volume average, we obtain 


1 1 1 
(r,M) C(r,M) w 


1 

= df(e — d 
f af(er) 40 

= 


1 1 


where g(e, P)=mz{e-C(r, P) } . For a given P, this function is 
additive and bounded for cells e of a three-dimensional lattice, 
and hence can be extended by definition uniquely to all sets 
spatially measurable Borel. 

As g(e, P) is a continuous function of P for every cell e of a 
net L, Theorem 4 applies, and we may thereby express this aver- 
age in a form that exhibits its mass function in terms of C(r, P) 


and f(e), 
1 1 
MO f P)jaf(er) |. 


(5) A,(r, M) = 

This result requires no restriction on f(e) other than those we 
have already stated. However, we shall state also a special case 
of Theorem 7. 


THEorEM 8. If f{I'(r, Q) } is differentiable with respect tor, and, 
for a fixed neighborhood of the given value of r, Of { I'(n, Q) } /or 
1s bounded independently of the point Q, then the average of the 
potential function u(Q) over the spherical surface C(r, M) ts a 
potential function of a distribution of positive mass with density 
(1/(4ar?) of {T'(r, Q)}/dr. In symbols, 

1 f af{t(,Q)} 40. 


(6) 


We have, for a rectangular cell e, 
ms{e-[I'(r:, P) — T(r, P)]} 


(7) m2{e-C(r, P)} = lim 


where <7j41 <7; and lim;_,7; =r. 


= 
— 
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Using the results given in (4), we have the following equality, 


j= [r(r:, P) — T(r, 


_f 


df(ep) 


The integrand of the left-hand member belongs to a sequence 
of measurable, uniformly bounded functions, as a function of P, 
whose limit exists when 7 becomes infinite; so we let 7 become 
infinite and interchange the order of integration and pass to 
the limit for the left-hand member. The same considerations 
hold for the integrand of the right-hand member as a function 
of Q. Using (13), we have 


af{T(r, Q)} 
J or a 


f P)} df(er) = 


The quantity of { T(r, Q) } /ar is non-negative. Hence we may 
substitute this last equation in (5) and change the Stieltjes in- 
tegral into a Lebesgue integral as we did above for the volume 
average. Thus we have established the theorem. 
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ERRATUM 


In my paper entitled On the summability of a certain class of 
series of Jacobi polynomials (this Bulletin, vol. 41 (1935), pp. 
541-549), the following change should be made; it conforms 
with the last proofs that I had seen. 

Page 544, 8th line from the bottom: read S8) instead of S. 


A. P. CowGILL 
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